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MATERIALS AND METHODS FOR COLORECTAL CANCER SCREENING, 
DIAGNOSIS, AND THERAPY 

FIELD OF THE INVENTION 

5 The present invention relates generally to methods and materials for 

altering colorectal cancer progression. The present invention also relates to 
techniques for screening for colon cancer and/or premalignancies. 

BACKGROUND 

10 The transcription factor Prox-1 is expressed in a number of tissues 

during embryonic development, including lens fiber cells, subpopulation of neurons in 
brains and neural tube, skeletal muscle, heart, liver, pancreas and lymphatic 
endothelial cells. Targeted inactivation of Prox-1 results in the defects of eye 
development because of the failure of lens fiber cells to elongate (Wigle et al., Nat. 

15 Genet. 21:31 8-22, 1999). Prox-1 is also necessary for the migration of hepatocytes 
during liver development (Sosa-Pineda et al., Nat. Genet. 25: 254-5, 2000). In 
addition, Prox-1 deficient embryos lack lymphatic vasculature, while the blood vessel 
development is not affected (Wigle etal., Cell 98: 769-778, 1999). 

Recently, others and we have demonstrated the essential role of Prox-1 

20 in the regulation of the lymphatic endothelial phenotype. Overexpression of Prox-1 in 
blood vascular endothelial cells, where it is otherwise absent, leads to the increased 
expression of lymphatic endothelial markers and to the suppression of the genes 
characteristic for the blood vascular endothelial lineage (Petrova et al., Embo J. 21: 
4593-9, 2002; Hong et al., Dev. Dyn. 225: 351-7, 2002). 

25 Notch is a transmembrane protein that acts as a receptor in a cell-cell 

signaling mechanism, and in combination with other cellular factors, influences 
differentiation, proliferation and apoptotic events at all stages of development 
(Artavanis-Tsakonas, Science 284: 770-776, 1999). In animal models, mutations in 
the Notch receptor have resulted in developmental abnormalities (Joutel et al., Nature 

30 383: 707, 1996; Li. et al., Nature Genet 16:243, 1997). 
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Cancer treatments generally promote tumor regression by inhibiting 
tumor cell proliferation, inhibiting angiogenesis (growth of new blood vessels that is 
necessary to support tumor growth) and/or prohibiting metastasis by reducing tumor 
cell motility or invasiveness. Colon cancers are a very common malignancy and 
5 colon cancers are typically adenocarcinomas, or sometimes carcinoid tumors. 

Treatment is primarily surgical resection of the colon, although chemotherapy has 
been found to be beneficial in some cases. These treatment options for colon cancer 
are of unpredictable and sometimes limited value, especially if the cancer has not 
been identified and removed at early stages. There continues to exist a need for novel 
10 therapies and diagnostic methods for cancer conditions. 



SUMMARY OF THE INVENTION 

The present invention addresses one or more ongoing needs by 
providing materials and methods for screening for and treating cancerous and 
15 precancerous conditions, especially colorectal in nature. 

As one aspect, the invention provides materials and methods to screen 
a mammalian subject for a cancerous or precancerous condition based on analysis of 
Prox-1 expression in cells from the mammalian subject. In particular, materials and 
methods are provided for screening colon tissue for signs of cancerous or 

20 precancerous pathology. 

For example, the method includes a method of screening colon tissue 
for a pathological condition, said method comprising: 

measuring Prox-1 expression in a biological sample that comprises 
colon tissue from a mammalian subject, wherein elevated Prox-1 expression in the 

25 colon tissue correlates with a pathological phenotype. The determination of elevated 
Prox-1 expression is generally made by way of a comparison, e.g., to a measurement 
of Prox-1 expression in healthy colon tissue (from the same subject or others of the 
same species, preferably matched for sex, age, race, or other characteristics); or to a 
measurement of Prox-1 expression in diseased (especially neoplastic) colon tissue. 

30 When comparing Prox- 1 expression in the colon tissue to Prox-1 expression in 

healthy colon tissue, an increased (e.g., elevated) Prox-1 expression in the colon tissue 
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from the mammalian subject correlates with a pathological phenotype. When 
comparing to diseased tissue, comparable levels of expression in the tissue from the 
subject correlates with a pathological phenotype. 

In another, related example, the invention includes a method of 
5 screening colon tissue for a pathological condition, the method comprising steps of: 

(a) obtaining a biological sample comprising colon tissue from a mammalian subject; 

(b) measuring Prox-1 expression in the colon tissue; and (c) screening for the 
presence or absence of a pathological condition from the measurement of Prox-1 in 
the sample. 

10 Similarly, the invention includes a method of screening colon tissue for 

a pathological condition, the method comprising steps of: (a) obtaining a biological 
sample comprising colon tissue from a mammalian subject; (b) measuring Prox-1 
expression in the colon tissue; and (c) "comparing Prox-1 expression in the colon 
tissue to Prox-1 expression in healthy colon tissue, wherein increased Prox-1 

1 5 expression in the colon tissue correlates with a pathological phenotype. 

For this type of method, the term "pathological condition" is intended 
to include any abnormality or evidence of disease that warrants medical treatment or 
monitoring due to concern of developing disease. Cancers and precancerous changes 
in tissue are particularly contemplated. Thus, in preferred embodiments, the method 

20 can be characterized as a screen for colon cancer or colorectal cancers, and increased 
Prox-1 expression in the colon tissue is scored as being indicative of a cancerous or 
precancerous condition. 

The method can be combined with any other molecular, cellular, 
pathological, or patient symptom criteria to assist a medical practitioner in early 

25 diagnosis and therapeutic or prophylactic therapy. For example, in one variation, the 
method further comprises measuring expression of at least one gene or protein 
selected from the group consisting of CD44, Encl, and ID2 in the colon tissue, 
wherein elevated Prox-1 expression and elevated expression of the at least one 
gene/protein in the colon tissue correlate with a pathological phenotype. In another 

30 variation, the method further comprising measuring activation of -catenin/TCF 
pathway in the colon tissue, wherein activation of the -catenin/TCF pathway and 
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elevated Prox-1 expression in the colon tissue correlate with a pathological 
phenotype. Activation of the -catenin/TCF pathway can be measured by a variety of 
indicators, including mutations in an APC gene; mutations in a -catenin gene; and 
nuclear localization of -catenin. 

5 The biological sample is any tissue or fluid sample obtained in any 

way from a mammalian subject that includes cells from the large intestine. Biopsies 
or other surgically removed specimens are preferred. Stool or feces may contain 
sufficient colon tissue for some embodiments of the assay. 

The assay may be performed on any mammalian subject, including 
. 10 laboratory animals used in cancer research, livestock, and domestic pets. Humans are 
most preferred. 

Any available technique can be used for measuring Prox-1 expression, 
including direct and indirect techniques. For example, in one variation, the measuring 
comprises measuring Prox-1 protein in the biological sample. Preferred techniques 

15 for measuring amounts or concentrations of Prox-1 protein in a sample are 

immunological techniques that involve use of a polyclonal or monoclonal antibody 
that specifically binds Prox-1, or use of a Prox-1 -binding fragment of such an 
antibody. For example, the measuring comprises contacting the colon tissue with a 
Prox-1 antibody or antigen-binding fragment thereof. Quantification of the amount of 

20 bound antibody (e.g., using a label or second, labeled antibody) provides a 

measurement of Prox-1 protein expressed in the sample. Immunoassays such as 
radioimmunoassay, immunoradiometric assay (labeled antibody), or an enzyme- 
linked immunosorbent assay (ELISA) are contemplated. 

hi another variation, the measuring comprises measuring Prox-1 

25 mRNA in the colon tissue. Elevated levels of Prox-1 mRNA in the sample are scored 
as elevated Prox-1 expression. Any available assay for measuring specific 
oligonucleotides is suitable. Preferred materials for such measurements are 
oligonucleotide probes complementary to all or a portion of the Prox-1 mRNA 
sequence. Probes of at least 14 and more preferably 18 nucleotides are preferred to 

30 assure specificity. One technique for measuring Prox- 1 mRNA comprises in situ 
hybridization to measure Prox-1 mRNA in the colon sample. Other techniques 
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involve steps of isolating mRNA from the colon tissue and measuring Prox-1 mRNA 
in the isolated mRNA, for example, by Northern hybridization procedures. In still 
another variation, quantitative reverse transcriptase polymerase chain reaction (PCR), 
real-time PCR, or other PCR techniques are employed to quantitatively amplify Prox- 
5 1 mRNA (relative to control samples) to provide a quantitative measurement of Prox- 
1 mRNA in the colon tissue. 

In yet another embodiment, Prox-1 expression is measured indirectly 
by measuring a functional property of Prox-1, such as measuring Prox-1 binding to 
DNA or downstream Prox-1 transcription factor effects. 

10 The screening method further includes a comparing step whereby 

Prox-1 expression in the colon tissue is compared to Prox-1 expression in healthy 
colon tissue, wherein increased Prox : l expression in the colon tissue correlates with a 
pathological phenotype. As described herein, Prox-1 expression is elevated in a 
statistically significant manner in pathological specimens studied, compared to 

15 healthy colon tissue samples. In one variation, the comparison is performed by taking 
simultaneous or sequential measurements of a test sample and a sample of colon 
tissue that is known to be taken from healthy tissue. In another variation, data is 
accumulated on the quantity of Prox-1 mRNA or protein in healthy tissues, and the 
amount that is measured in the colon tissue from the biological sample is compared to 

20 this predetermined amount. It will be appreciated that comparing Prox- 1 

measurements from a test sample to measurements from a cancerous or precancerous 
condition can provide an equivalent indication of the presence or absence of the 
pathological condition, wherein a test sample with Prox-1 expression comparable to 
the elevated level observed in a cancer correlates with a pathological phenotype. 

25 For measurement comparisons , a database of Prox- 1 measurements 

from colon tissues can be developed, preferably containing information about 
healthiness or disease of the tissue; age, sex, race/ethnicity of the donor, and location 
from which the sample was taken. With a database of samples, comparisons can be 
analyzed using statistical analysis to determine the statistical significance of a 

30 measurement's deviation from a mean, optionally selecting entries from the database 
by selecting for the patient's age, sex, ethnicity, and other factors to best match the 
patient (mammalian subject) being tested. Such statistical analysis permits 
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establishment of one or more "cutoff' values for the Prox-1 measurement that are 
correlated "with a likelihood of having, or developing, a cancerous condition. 

If elevated Prox-1 is detected, then in a preferred embodiment, the 
method further comprises a step of administering to a human subject identified as 
5 having a pathological condition characterized by increased Prox-1 expression in colon 
tissue a composition comprising a Prox-1 inhibitor. 

In a related embodiment, the invention provides a method of inhibiting 
the growth of colon cancer cells, such as colon carcinoma cells, colon adenoma cells, 
or colon adenocarcinoma cells in a mammalian subject comprising a step of: 

10 administering to the subject a composition comprising a molecule that 

suppresses expression of Prox-1, thereby inhibiting the growth of colon carcinoma 
cells. 

For reasons of cost, safety, and efficacy, it is becoming increasingly 
preferred to attempt to identify patients most likely to benefit from a therapeutic 

15 regimen before administering it. This is especially true with cancers where it is 
known that not all patients respond the same to all therapies. Thus, in a preferred 
variation of the method, steps are taken to identify patients most likely to benefit from 
this regimen. For example, the method further comprises a step of identifying a 
mammalian subject with a colon cancer characterized by increased Prox-1 expression. 

20 The composition is administered to such a patient after the identifying step, because 
cancers characterized by the elevated expression are expected to be the cancers most 
likely to respond to the inhibitors. Exemplary cancers (neoplasms) in which Prox-1 
elevation has been observed include colorectal adenomas and colorectal carcinomas, 
as described below in greater detail. 

25 The composition to be administered preferably includes, in addition to 

the Prox-1 inhibitor, a pharmaceutically acceptable diluent, adjuvant, or carrier 
medium. The composition optionally includes additional antineoplastic agents. 

Administration of any Prox-1 inhibitors, alone or in combination, is 
contemplated for this invention, either alone or in combination with other Prox-1 

30 inhibitors or other antineoplastic agents. Exemplary inhibitor molecules include 
antisense oligonucleotides that inhibit Prox-1 expression; micro-RNA that inhibits 



WO 2005/014854 



PCT/EP2004/008819 



Prox-1 expression; small (short) interfering RNA (siRNA) that inhibit Prox-1 
expression (e.g., siRNA that comprise at least one nucleotide sequence set forth in 
SEQ ED NOS: 4, 5, 6, and 7); zinc finger proteins that inhibit Prox-1 expression; 
polypeptides that act as dominant negative form of Prox-1 protein, such as Prox-1 
5 forms that have a disrupted DNA binding domain or transactivation domain(s); 
polynucleotides that encode dominant-negative Prox-1 proteins; Prox-1 antibodies 
and fragments thereof; polynucleotides that encode Prox-1 antibodies or encode 
" polypeptides that comprise Prox-1 binding domains; small molecules discovered and- 
designed through screening based on the teachings herein, and so on. U.S. Patent 
10 Application Publication No. 2003/02245 16 discloses exemplary molecules for 
inhibiting Prox-1 expression and is incorporated herein by reference. 

The inhibitor is preferably administered in an amount and in a regimen 
that halts or inhibits neoplastic growth of the affected colorectal tissue. As another 
benchmark, the tissue itself preferably reverts to a non-transformed, more healthy 

15 looking phenotype. As described herein, one apparent benchmark of beneficial 
administration is an increase in Notch-1 signaling. Thus, in one variation, the 
composition is administered in an amount effective to suppress Prox-1 expression and 
increase Notch 1 signaling. 

Other indications of efficacy relate to modulation of prostaglandin 

20 synthesis. Thus, in another variation, the composition is administered in an amount 
effective to increase 15-PDGH activity or decrease prostaglandin D2 synthase 
activity. 

As described herein and in literature, colorectal cancers also are often 
characterized by increases in the -catenin/TCF signaling pathway, relative to what is 

25 observable in healthy colorectal tissue. Thus, in a preferred variation, in addition to 
administering a Prox-1 inhibitor composition, the regimen further comprises 
administering to the subject an inhibitor of the -catenin/TCF signaling pathway. 
(Optionally, the patient's diseased tissue is first pre-screened for elevated 
expression/signaling of this pathway.) The categories of inhibitors described above 

30 for Prox- 1 are specifically contemplated for the -catenin/TCF pathway as well. In 
one variation, the inhibitor of the -catenin/TCF signaling pathway is dominant 
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negative form of TCF-4. The inhibitor optionally targets (inhibits) TCF-4, 6-catenin, 
or c-myc expression or activity. 

In yet another variation, administration of the Prox-1 inhibitor is 
combined with administration of a COX-2 inhibitor, such as any of the increasing 
5 class of non-steroidal anti-inflammatory agents. 

In still another variation, administration of the Prox-1 inhibitor is 
combined with administration of a Notch signaling pathway agonist, such as a Notch . 
ligand or expression vector to cause expression of a Notch ligand. Exemplary Notch 
ligands include Jagged 1, Jagged2, Deltal, Delta3, Delta4, or Serrate. 

10 Also contemplated is administration of a molecule that comprises an 

inhibitor of DNA methyluansferases. Such inhibitors are themselves contemplated as 
efficacious for inhibiting Prox-1 expression, and can be combined with any other 
Prox-1 inhibitor described herein for combination therapy. An exemplary 
methyltransferase inhibitor is 5-aza-2'-deoxycytidine. 

15 In still another variation, the Prox-1 inhibitor composition is 

administered in combination with any known antineoplastic agent that is used in 
cancer therapy. 

In still another variation, the Prox-1 inhibitor and/or Cox-2 inhibitor 
are combined (in a medicament or as a combination therapy) with an agent that 
20 induces differentiation in colorectal cancer cell lines. Exemplary agents include 1,25- 
dihydroxyvitamin D3 and analogs thereof; butyrate; and retinoids. 

With respect to any combination treatment or therapy regimens 
described herein, the Prox-1 inhibitor composition can be administered 
simultaneously with the other active agents, which may be in admixture with the 
25 Prox-1 inhibitor, or may be in a separate composition. Each composition preferably 
includes a pharmaceutically acceptable diluent, adjuvant, or carrier. When the agents 
are separately administered, they may be administered in any order. 

In still another embodiment, the invention includes a method of 
inhibiting Prox-1 function in a mammalian subject having a disease characterized by 
30 of Prox- 1 over-expression in cells, comprising the step of administering to said 
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mammalian subject a composition, said composition comprising a compound 
effective to inhibit Prox-1 function in cells. 

In still another variation, the invention includes the use of a Prox-1 
inhibitor in the manufacture of a medicament for the treatment of a disease 
5 characterized by Prox-1 over-expression in cells, especially cancerous or 

precancerous cells of colorectal origin. The medicament optionally includes the 
additional agents described above, either in admixture with the Prox-1 inhibitor or 
separated, yet packaged together (preferably with instructions for treating the disease). 

In yet another embodiment, the invention provides a method of 
10 screening for Prox-1 modulators comprising the steps of: (a) contacting a test 

molecule with Prox- 1 protein, or a nucleic acid comprising a nucleotide sequence that 
encodes Prox-1 protein, under conditions which permit the interaction of the test 
molecule with the Prox-1 protein or nucleic acid; and (b) measuring the interaction 
between the test molecule and Prox-1 protein or the nucleic acid, wherein a test 
15 molecule that binds the Prox-1 protein or nucleic acid is identified as a Prox-1 
modulator. 

"Test molecule" refers to the molecule that is under evaluation for the 
ability to modulate (i.e., increase or decrease) the activity of Prox-1 protein. Most 
commonly, a test molecule that is a Prox-1 modulator will interact directly with Prox- 

20 1. However, the screens described herein can identify test molecules that modulate 
Prox-1 protein activity indirectly, such as by affecting Prox-1 gene expression. The 
screens work with essentially any test molecule, and the invention is not limited in 
this manner. In preferred embodiments, the test molecule is a protein, a carbohydrate, 
a lipid, or a nucleic acid. Molecules which regulate Prox-1 expression include nucleic 

25 acids which are complementary to nucleic acids encoding a Prox-1 protein, or are 
complementary to nucleic acid sequences which direct or control the expression of 
Prox-1 protein, and which act as anti-sense regulators of expression. The test 
molecule may be a member of a chemical library, such as libraries commonly 
maintained in large pharmaceutical companies or libraries generated combinatorially. 

30 In alternate embodiments, the test molecule interacts with Prox-1 by binding to the 
Prox-1 DNA binding domain, thereby effecting Prox-1 activity. 
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With respect to the screening methods described herein, it may be 
desirable to evaluate two or more test compounds together for their ability to increase 
or decrease the Prox-1 protein activity or expression. The assays set forth herein can 
be readily modified by adding such additional test compounds either simultaneous 
5 with, or subsequent to, or prior to, the first test compound. In additional 

embodiments, the measurement of the interaction of test molecules with Prox-1 may 
is carried out using solution-phase assays or immunoassays, In other embodiments, 
measurement of the interaction of test molecules with Prox-1 is carried out by 
evaluating biological activity of Prox- 1 . 

10 In a related embodiment, the invention provides a method of screening for 

modulators of binding between a DNA and Prox-1 protein comprising steps of: (a) 
contacting a DNA with a Prox-1 protein in the presence and in the absence of a 
putative modulator compound; (b) detecting binding between the DNA and the Prox-1 
protein in the presence and absence of the putative modulator compound; and (c) 

15 identifying a modulator compound based on a decrease or increase in binding between 
the DNA and the Prox-1 protein in the presence of the putative modulator compound, 
as compared to binding in the absence of the putative modulator compound. 

In a related variation, molecules that modulate binding between DNA and 
Prox-1 are formulated into a composition or a growth media for contacting a cell from 

20 a colorectal cancer or colorectal cancer cell line, and a modulator that inhibits growth 
of the cell is selected as a preferred modulator for development as a therapeutic. 

In yet another related embodiment, the invention provides a method of 
screening for modulators of binding between a DNA and Prox-1 protein comprising 
steps of: (a) contacting a DNA with a Prox-1 protein in the presence and in the 

25 absence of a putative modulator compound; (b) detecting binding between the DNA 
and the Prox-1 protein in the presence and absence of the putative modulator 
compound; and (c) identifying a modulator compound based on a decrease or increase 
in differentiation in the presence of the putative modulator compound, as compared to 
differentiation in the absence of the putative modulator compound. 

30 In vivo screening also is contemplated, either in addition to or in place of in 

vitro screening. The test compound preferably is formulated into a pharmaceutically 
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acceptable diluent, adjuvant, or carrier. In a preferred variation, this formulation is 
administered to a mammal with pathological (e.g., cancerous) Prox-1 expressing 
colon tissue, and the efficacy of the formulation at inhibiting disease progression is 
monitored. For example, a method described above optionally further comprises 
5 steps of formulating a composition comprising the selected Prox- 1 modulator and a 
pharmaceutically acceptable carrier; administering the composition to a mammalian 
subject having a colorectal cancer; and monitoring the mammalian subject for growth, 
metastasis, shrinkage, or disappearance of the colorectal cancer. 

"Putative modulator compounds" are analogous to the "test molecules" 

10 described above in that they are alleged to have an effect on Prox-1 protein activity 
and are being identified as such using the methods described herein, hi certain 
embodiments detecting DNA binding to Prox-1 protein and identifying an increase or 
decrease of DNA binding to Prox-1 protein employs immuno-based assays or various 
other assays that measure biological activity. Likewise, embodied by the invention 

15 are methods wherein identifying a modulator compound the use of proliferation 
and/or differentiation assays. 

In still another variation of the invention, provided are short interfering RNA 
(siRNA) molecules tiiat down regulate expression of Prox-1 by RNA interference. 
The siRNA molecule can be adapted for use to treat colorectal cancer and any other 

20 indications that respond to the level of Prox-1. The siRNA molecule comprises a 
sense region and an antisense region. The antisense region comprises sequence 
complementary to an RNA sequence encoding Prox-1, or a fragment thereof, and the 
sense region comprise sequence complementary to the antisense region. In additional 
embodiments, the siRNA molecule can comprise two nucleic acid fragments, wherein 

25 one fragment comprises the sense region and the second fragment comprises the 
antisense region of said siRNA molecule. 

In one embodiment, a siRNA molecule of the invention can comprise any 
contiguous Prox-1 sequence. Preferably, the siRNA constructs are between 18 and 
100 nucleotides in length. More preferably, the siRNA constructs are 21 nucleotides 

30 in length. In still another embodiment, the sense region of a siRNA molecule of the 
invention comprises a 3'-terminal overhang and the antisense region comprises a 3'- 
terminal overhang. The 3'-terminal overhangs each are preferably from 1 to 5 
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nucleotides. More preferably, the 3 '-terminal overhangs are 2 nucleotides. In a 
preferred embodiment, the antisense region of the 3 '-terminal nucleotide overhang is 
complementary to RNA encoding Prox-1. 

With respect to the antisense region of the siRNA constructs, the antisense 
5 region of Prox- 1 siRNA constructs can comprise a sequence complementary to 

sequence having any of SEQ ID NOs. 4 and 6. Further, the antisense region of Prox-1 
siRNA constructs can comprise a having any of SEQ ID NOs. 5 and 7. 

In yet an additional embodiment of the invention, compounds, particularly 
antisense oligonucleotides, which are targeted to a nucleic acid encoding Prox-1, and 

10 which modulate the expression of Prox-1 are provided. The antisense 

oligonucleotides of the invention are preferably complementary to (at least a segment 
of) the genomic Prox-1 sequence set forth as SEQ ED NO:l. mRNA splice sites, i.e., 
intron-exon junctions, may be preferred target regions. Accordingly, in another 
embodiment, the antisense oligonucleotides of the invention comprise a region 

15 complementary to a promoter or other control region, an exon, an intron, or an exon- 
intron boundary. Also embodied by the present invention are antisense 
oligonucleotides that are complementary to a region within 20-200 bases of an exon- 
intron splice junction. As detailed herein, pharmaceutical compositions comprising 
antisense oligonucleotides are also provided. 

20 The foregoing paragraphs are not intended to define every aspect of the 

invention, and additional aspects are described in other sections, such as the Detailed 
Description. The entire document is intended to be related as a unified disclosure, and 
it should be understood that all combinations of features described herein are 
contemplated, even if the combination of features are not found together in the same 

25 sentence, or paragraph, or section of this document. Where protein therapy is 

described, embodiments involving polynucleotide therapy (using polynucleotides that 
encode the protein) are specifically contemplated, and the reverse also is true. 

In addition to the foregoing, the invention includes, as an additional 
aspect, all embodiments of the invention narrower in scope in any way than the 

30 variations defined by specific paragraphs above. For example, certain aspects of the 
invention that are described as a genus, and it should be understood that every 
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member of a genus is, individually, an aspect of the invention. Although the 
applicant(s) invented the full scope of the invention described herein, the applicants 
do not intend to claim subject matter described in the prior art work of others. 
Therefore, in the event that statutory prior art within the scope of a claim is brought to 
5 the attention of the applicants by a Patent Office or other entity or individual, the 
applicant(s) reserve the right to exercise amendment rights under applicable patent 
laws to redefine the subject matter of such a claim to specifically exclude such 
statutory prior art or obvious variations of statutory prior art from the scope of such a 
claim. Variations of the invention defined by such amended claims also' are intended 
10 as aspects of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A, IB, and 1C depict the elevated Prox-1 mRNA levels in 
colorectal tumors. A cancer RNA profiling array was hybridized to probes for Prox-1 
15 (Fig. 1A) and the lymphatic endothelial marker LYVE-1 (Fig. IB). Fig. 1C illustrates 
the quantification of dot blot in Fig. 1 A, the asterisk indicating tumor samples in 
which Prox-1 expression is significantly different from that of the normal tissue 
(P<0.005). 

Figures 2A-2I depict Prox-1 expression patterns in colon cancer and 
20 normal colonic epithelium. Frozen sections of colon adenomas (Fig. 2A-C) or 

adenocarcinomas (Fig. 2D-F) and the corresponding normal tissues (Fig. 2H-I) were 
stained for Prox-1 . Fig. 2C and Fig. 21 show high power magnification of adenoma 
and normal colon sections. 

Figure 3 depicts the efficacy of Prox-1 suppression for inhibiting 
25 SW480R cell growth in soft agar. SW480R cells were transfected with GFP siPRNA, 
Prox-lsiRNA A16 or Prox-1 siRNA A25 or left untreated, and seeded in soft agar in 
triplicate. The number of colonies was scored after two weeks of growth. 
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DETAILED DESCRIPTION 

Demonstrated herein for the first time is the importance of Prox-1 in 
cancer. The Prox-1 gene and protein is overexpressed in colorectal cancers, as 
compared to healthy colon tissue and other cancer tissues. Prox-1 was overexpressed 
5 in 68% of colorectal carcinomas and in 80% of premalignant lesions that were 
examined, indicating that Prox-1 is important for tumorigenesis, and therefore a 
useful marker for screening and a useful target for intervention. In normal colonic 
epithelium, Prox-1 expression was restricted to two cell types, neuroendocrine cells 
and non-proliferating cells at the very base of the colonic crypts, a location that 
10 corresponds to the stem cell compartment. Contemplated and provided for in the 

present invention are polynucleotides and polypeptides for screening and diagnosis of 
colorectal cancer and/or premalignancies. 

Intervention to suppress Prox-1 expression in colorectal cells resulted 
in increased activation of Notch signal transduction. Specifically, ablation of Prox-1 

15 resulted in cell growth arrest and increased expression of epithelial markers. This was 
accompanied by an upregulation of the cell cycle inhibitor p21cipl, which has been 
shown to be important for the differentiation of intestinal cpithclia (Quaroni ct al., 
Am. J. Physiol. Cell Physiol. 279: C1045-57, 2000; Yang et al., Cancer Res. 61, 565- 
9, 2001), and by an increased expression of components of the Notch signaling 

20 pathway. Unexpectedly, this phenotype persisted for up to two weeks after transient 
transfection with Prox-1 siRNAs, demonstrating profound changes in the 
transcriptional program induced in the absence of Prox-1. Without intending to be 
hmited to a particular theory or mechanism, Prox-1 may be involved in the 
maintenance of an undifferentiated state of colonic intestinal stem cells, and 

25 overexpression of Prox-1 in cancer cells and resulting inhibition of the Notch 
signaling pathway may lead to the de-differentiation frequently observed upon 
malignant transformation. The suppression of Prox-1 expression also negatively 
regulates prostaglandin activity in the tumor cell lines studied. It is, therefore, 
contemplated that suppression of Prox-1 or activation of Notch signaling in tumor 

30 cells can provide a differentiation therapy for colon carcinoma. The present 

invention, more specifically, provides compositions and methods for suppressing 
Prox-1 expression. 
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A. Inhibitory Nucleic Acid Constructs for the Suppression of Prox-1 
Expression 

As discussed herein, Prox-1 is overexpressed in colorectal cancer cells 
and suppression of Prox-1 expression results in increased Notch signal transduction 
5 and modified expression of enzymes of the prostaglandin biosynthetic pathway. This 
data provides an indication to disrupt the expression or activity of Prox-1 as a method 
of alleviating the symptoms of and/or inhibiting the growth or metastasis of colon 
- cancer. Such disruption is achieved using any materials or methods available to - • 
inhibit Prox-1 mRNA or protein expression, or inhibit Prox-1 binding, and any Prox-1 
10 activity. The present section discusses nucleic acid-based methods of disrupting the 
expression of Prox-1. Polynucleotide products which are useful in this endeavor 
include antisense polynucleotides, ribozymes, small interfering RNAs, natural or 
designed microRNAs, triple helix polynucleotides, and novel transcription factors that 
modulate the expression of Prox-1 protein. 

15 Techniques for making and delivering antisense polynucleotides and 

ribozymes are well known to those in the art and have been extensively described in 
scientific, patent, and trade literature. (PCT Publication No. WO 00/32765; (J Biol 
Chem ;272:626-38. 1997); Kurreck etal, (Nucleic Acids Res. ;30:1911-8. 2002); 
Crooke and B. Lebleu, eds. Antisense Research and Applications (1993) CRC Press; 

20 and Antisense RNA and DNA (1 988) D. A. Melton, Ed. Cold Spring Harbor 

Laboratory Cold Spring Harbor, N.Y.) Anti-sense RNA and DNA molecules act to 
directly block the translation of mRNA by binding to targeted mRNA and preventing 
protein translation. An example of an antisense polynucleotide is an 
oligodeoxyribonucleotide derived from the translation initiation site, e.g., between -10 

25 and +10 regions of the relevant nucleotide sequence. Antisense oligonucleotides of 8- 
200 nucleotides in length that include at least a portion of this region of the Prox-1 
cDNA or genomic sequences set forth as SEQ ID NOs: 1 and 2 (or are 
complementary to) are preferred Prox-1 inhibitors of the invention. 

Antisense polynucleotides are typically generated within the cell by 

30 expression from antisense constructs that contain the antisense nucleic acid strand as 
the transcribed strand. Antisense methodology takes advantage of the fact that 
nucleic acids tend to pair with "complementary" sequences. By complementary, it is 
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meant that polynucleotides are those which are capable of base-pairing according to 
the standard Watson-Crick complementarity rules. That is, the larger purines will 
base pair with the smaller pyrimidines to form combinations of guanine paired with 
cytosine (G:C) and adenine paired with either thymine (A:T) in the case of DNA, or 
5 adenine paired with uracil (A:U) in the case of RNA. Inclusion of less common bases 
such as inosine, 5-methylcytosine, 6-methyladenine, hypoxanthine and others in 
hybridizing sequences does not interfere with pairing. 

Targeting double-stranded (ds) DNA with polynucleotides leads to 
triple-helix formation; targeting RNA will lead to double-helix formation. Antisense 

10 polynucleotides, when introduced into a target cell, specifically bind to their target 

polynucleotide and interfere with transcription, RNA processing, transport, translation 
and/or stability. Antisense RNA constructs, or DNA encoding such antisense RNA's, 
may be employed to inhibit gene transcription or translation or both within a host cell, 
either in vitro or in vivo, such as within a host animal, including a human subject. 

15 Antisense constructs may be designed to bind to the promoter and 

other control regions, exons, introns or even exon-intron boundaries of a gene. 
Highly effective antisense constructs include regions complementary to intron/exon 
splice junctions. Thus, a preferred embodiment includes an antisense construct with 
complementarity to regions within 50-200 bases of an intron-exon splice junction. It 

20 has been observed that some exon sequences can be included in the construct without 
seriously affecting the target selectivity thereof. The amount of exonic material 
included will vary depending on the particular exon and intron sequences used. One 
can readily test whether too much exon DNA is included simply by testing the 
constructs in vitro to determine whether normal cellular function is affected or 

25 whether the expression of related genes having complementary sequences is affected. 

For purposes of making antisense oligonucleotides, polynucleotide 
sequences that are substantially complementary over their entire length and have zero 
or very few base mismatches are preferred. For example, sequences of fifteen bases 
in length preferably have complementary nucleotides at thirteen or fourteen or fifteen 

30 positions. Naturally, sequences which are completely complementary will be 

sequences which are entirely complementary throughout their entire length and have 
no base mismatches. Other sequences with lower degrees of homology also are 
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contemplated. For example, an antisense construct which has limited regions of high 
homology, but also contains a non-homologous region (e.g., ribozymes) could be 
designed. These molecules, though having less than 50% homology, would bind to 
target sequences under appropriate conditions. 

5 Methods for designing and optimizing antisense nucleotides are 

described in Lima et al., (J Biol Chem; 272:626-38. 1997) and Kurreck et al., (Nucleic 
Acids Res.; 30:1911-8. 2002). Additionally, commercial software and online 
resources are available to optimize antisense sequence selection and also to compare 
selected sequences to known genomic sequences to help ensure uniqueness/specificity 

10 for a chosen gene. (See, e.g., world wide web at sfold.wadsworth.org/index.pl.) Such 
uniqueness can be further confirmed by hybridization analyses. Antisense nucleic 
acids are introduced into cells (e.g., by a viral vector or colloidal dispersion system 
such as a liposome). 

The genomic contig of chromosome 1 (where Prox-1 is located), 

15 cDNA for Prox-1, and protein sequences for Prox-1 (SEQ ID NOs: 1 , 2, and 3, 
respectively) are published and disclosed as Genbank Accession Numbers 
NT_021877, NM_002763, and NM_002763, respectively. The Genbank Database is 
accessible on the world wide web at ncbi.nlm.nih.gov. Related Prox-1 protein and/or 
nucleic acid sequences from other sources may be identified using probes directed at 

20 these sequences. Such additional sequences may be useful in certain aspects of the 
present invention. Although antisense sequences may be full length genomic or 
cDNA copies, they also may be shorter fragments or oligonucleotides e.g., 
polynucleotides of 100 or less bases. Although shorter oligomers (8-20) are easier to 
make and more easily permeable in vivo, other factors also are involved in 

25 determining the specificity of base pairing. For example, the binding affinity and 

sequence specificity of an oligonucleotide to its complementary target increases with 
increasing length. It is contemplated that oligonucleotides of 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, or more base pairs will be used. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the 

30 specific cleavage of RNA. The cleavage event renders the mRNA unstable and 

prevents protein expression. The mechanism of ribozyme action involves sequence 
specific interaction of the ribozyme molecule to complementary target RNA, followed 
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by an endonucleolytic cleavage. Within the scope of the invention are engineered 
hammerhead, for which the substrate sequence requirements are minimal, or other 
motif ribozyme molecules that specifically and efficiently catalyze endonucleolytic 
cleavage of RNA sequences encoding protein complex components. Design of the 
5 hammerhead ribozyme and the therapeutic uses of ribozymes are disclosed in Usman 
et al., Current Opin. Struct. Biol. (1996) 6:527-533. Ribozymes can also be prepared 
and used as described in Long et al., FASEB J. (1993) 7:25; Symons, Ann. Rev. 
Biochem. (1992) 61:641; Perrotta et al., Biochem. (1992) 31:16-17; Ojwanget al., 
Proc. Natl. Acad. Sci. (USA) (1992) 89:10802-10806; and U.S. Pat. No. 5,254,678. 

10 Methods of cleaving RNA using ribozymes is described in U.S. Pat. No. 5,1 16,742; 
and methods for increasing the specificity of ribozymes are described in U.S. Pat. No. 
5,225,337 and Koizumi et al, Nucleic Acid Res. (1989) 17:7059-7071. Preparation 
and use .of ribozyme fragments in a hairpin structure are described by Chowrira and 
Burke, Nucleic Acids Res. (1992) 20:2835. Ribozymes can also be made by rolling 

15 transcription (Daubendiek and Kool, Nat. Biotechnol. (1997) 15(3):273-277). 

The full-length gene need not be known in order to design and use 
specific inhibitory ribozymes. Specific ribozyme cleavage sites within any potential 
' RNA target are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the foUowing sequences, GUA, GUU, and GUC. Once 

20 identified, short RNA sequences of between 15 and 20 ribonucleotides corresponding 
to the region of the target gene containing the cleavage site may be evaluated for 
predicted structural features, such as secondary structure, that may render the 
oligonucleotide sequence unsuitable. The suitability of candidate targets may also be 
evaluated by testing their accessibility to hybridization with complementary 

25 oligonucleotides, using ribonuclease protection assays praper PCT WO 93/23569; 
and U.S. Pat. No. 5,093,246, incorporated herein by reference). Using the nucleic 
acid sequences disclosed herein and methods known in the art, ribozymes can be 
designed to specifically bind and cut the corresponding mRNA species. Ribozymes, 
. therefore, provide a means to inhibit the expression Prox-1. 

30 Alternatively, endogenous gene expression can be reduced by 

inactivating or "knocking out" the gene or its promoter using targeted homologous 
recombination. (E.g., see Smithies et al., 1985, Nature 317:230-234; Thomas & 
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Capecchi, 1987, Cell 51:503-512; Thompson et al., 1989 Cell 5:313-321). For 
example, a mutant, non-functional gene (or a completely unrelated DNA sequence) 
flanked by DNA homologous to the endogenous gene (either the coding regions or 
regulatory regions of the gene) can be used to transfect cells that express that gene in 
5 vivo. Insertion of the DNA construct, via targeted homologous recombination, results 
in inactivation of the gene. 

Gene expression can also be reduced by targeting deoxyribonucleotide 
sequences complementary to the regulatory region of the target gene (i.e., the gene 
promoter and/or enhancers) to form triple helical structures that prevent transcription 

10 of the gene in target cells in the body. (See generally, Helene, C. 1991, Anticancer 
Drug Des., 6(6):569-84; Helene, C, et al., 1992, Ann, N.Y. Acad. Sci., 660:27-36; 
and Maher, L. J., 1992, Bioassays 14(12):807-15). Nucleic acid molecules used in 
triple helix formation for the inhibition of transcription are generally single stranded 
deoxyribonucleotides. The base composition must be designed to promote triple helix 

15 formation via Hoogsteen base pairing rules, which generally require sizeable stretches 
of either purines or pyrimidines to be present on one strand of a duplex. Nucleotide 
sequences may be pyrimidine-based, which will result in TAT and CGC+ triplets 
across the three associated strands of the resulting triple helix. The pyrimidine-rich 
molecules provide base complementarity to a purine-rich region of a single strand of 

20 the duplex in a parallel orientation to that strand. In addition, nucleic acid molecules 
may be chosen that are purine-rich, for example, containing a stretch of G residues. 
These molecules will form a triple helix with a DNA duplex that is rich in GC pairs, 
in which the majority of the purine residues are located on a single strand of the 
targeted duplex, resulting in GGC triplets across the three strands in the triplex. 

25 Alternatively, the potential sequences that can be targeted for triple 

helix formation may be increased by creating a so called "switchback" nucleic acid 
molecule. Switchback molecules are synthesized in an alternating 5 '-3', 3 '-5' manner, 
such that they base pair with first one strand of a duplex and then the other, 
eliminating the necessity for a sizeable stretch of either purines or pyrimidines to be 

30 present on one strand of a duplex. 

Another technique for inhibiting the expression of a gene involves the 
use of RNA for induction of RNA interference (RNAi), using double stranded 
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(dsRNA) (Fire et al, Nature 391: 806-811. 1998) or small interfering RNA (siRNA) 
sequences (Elbashir et al, Nature 411, 494 - 498 (2001)); Yu et al., Proc Natl Acad 
Sci USA. 99:6047-52 (2002). "RNAi" is the process by which dsRNA induces 
homology-dependent degradation of complimentary mRNA. The presence of dsRNA 
5 in cells triggers the RNAi response though a mechanism that has yet to be fully 
characterized. In one embodiment, a synthetic antisense nucleic acid molecule is 
hybridized by complementary base pairing with a "sense" ribonucleic acid to form a 
double stranded RNA. The presence of long dsRNAs in cells stimulates the activity 
of a ribonuclease HI enzyme. The dsRNA antisense and sense nucleic acid molecules 

10 are provided that correspond to at least about 20, 25, 50, 100, 250 or 500 nucleotides 
or an entire Prox-1 coding strand, or to only a portion thereof. In an alternative 
embodiment, the siRNAs are 30 nucleotides or less in length, and more preferably 21- 
to 23-nucleotides, with characteristic 2- to 3- nucleotide 3 '-overhanging ends, which 
are generated by ribonuclease in cleavage from longer dsRNAs. {See e.g. Tuschl T. 

15 Nat Biotechnol. 20:446-48. 2002). At notably higher concentrations single stranded 
21 nucleotide RNA molecules have been also shown to function as siRNAs {i.e., enter 
the RNAi pathway and specifically target mRNA for degradation in mammalian cells 
(Martinez et al., Cell 110, 563-574, 2002). Cleavage of the target RNA takes place in 
the middle of the region complementary to the antisense strand of the siRNA duplex 

20 (Elbashir et al., 2001, Genes Dev., 15, 188). 

Intracellular transcription of small RNA molecules can be achieved by 
cloning the siRNA templates into RNA polymerase HI (Pol HI) transcription units, 
which normally encode the small nuclear RNA (snRNA) U6 or the human RNAse P 
RNA HI. Two approaches can be used to express siRNAs: in one embodiment, sense 

25 and antisense strands constituting the siRNA duplex are transcribed using constructs 
with individual promoters (Lee, et al. Nat. Biotechnol. 20, 500-505. 2002); in an 
alternative embodiment, siRNAs are expressed as stem-loop hairpin RNA structures 
that give rise to siRNAs after intracellular processing (Brummelkamp et al. Science 
296:550-553. 2002, herein incorporated by reference). Alternatively, a stem loop 

30 hairpin can be expressed within an unrelated Pol E transcribed mRNA transcript. A 
stem-loop hairpin designed to contain the siRNA sequence also contains conserved 
microRNA sequences within the loop and stem regions, thus resembling a natural 



WO 2005/014854 



-21- 



PCT/EP2004/008819 



precursor mRNA structure. Subsequently, the precursor can be processed by the 
cellular RNAi components to yield mature, functional siRNA/miRNA. (See, 
generally, Zeng et al., Mol Cell 9, 1327-1333 (2002); Hutvagner et al., Science 297, 
2056-2060 (2002); Kawasake etal. Nature 423, 838-842 (2003)). 

5 RNAi has been studied in a variety of systems. Work in Drosophila 

embryonic lysates (Elbashir et al., 2001, EMBO J, 20, 6877) has revealed certain 
requirements for siRNA length, structure, chemical composition, and sequence that 
are essential to mediate efficient RNAi activity. Twenty-one nucleotide siRNA 
duplexes are most active when containing two nucleotide 3'-overhangs. Replacing the 

10 3 '-overhanging segments of a 21-mer siRNA duplex having 2 nucleotide 3' overhangs 
with deoxyribonucleotides has no adverse effect on RNAi activity, while, replacing 
up to 4 nucleotides on each end of the siRNA with deoxyribonucleotides may be well 
tolerated. Complete substitution with deoxyribonucleotides results in no RNAi 
activity (Elbashir et al., 2001, EMBO J., 20, 6877). 

15 Furthermore, complete substitution of one or both siRNA strands with 

2 -deoxy (2'-H) or 2'-0-methyl nucleotides results in no RNAi activity, whereas 
substitution of the 3'-terminal siRNA overhang nucleotides with deoxy nucleotides 
(2'-H) is tolerated. Single mismatch sequences in the center of the siRNA duplex may 
abolish RNAi activity. In addition, studies indicate that the position of the cleavage 

20 site in the target RNA is defined by the 5'-end of the siRNA guide sequence rather 
than the 3'-end (Elbashir et al., 2001, EMBO J, 20, 6877). Other studies indicate that 
a 5 '-phosphate on the target-complementary strand of a siRNA duplex is required for 
siRNA activity and that ATP is utilized to maintain the 5'-phosphate moiety on the 
siRNA (Nykanen et al., 2001, Cell, 107, 309). 

25 The dsRNA/siRNA is most commonly administered by annealing 

sense and antisense RNA strands in vitro before delivery to the organism, hi an 
alternate embodiment, RNAi may be carried out by administering sense and antisense 
nucleic acids of the invention in the same solution without annealing prior to 
administration, and may even be performed by administering the nucleic acids in 

30 separate vehicles within a very close timeframe. 
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Genetic control can also be achieved through the design of novel 
transcription factors for modulating expression of the gene of interest in native cells 
and animals. For example, the Cys2-His2 zinc finger proteins, which bind DNA via 
their zinc finger domains, have been shown to be amenable to structural changes that 
5 lead to the recognition of different target sequences. These artificial zinc finger 
proteins recognize specific target sites with high affinity and low dissociation 
constants, and are able to act as gene switches to modulate gene expression. 
- Knowledge of the particular target sequence of the present invention facilitates the 
engineering of zinc finger proteins specific for the target sequence using known 

10 methods such as a combination of structure-based modeling and screening of phage 
display libraries (Segal et al, (1999) Proc Natl Acad Sci USA 96:2758-2763; Liu et 
al., (1997) Proc Natl Acad Sci USA 94:5525-30; Greisman and Pabo (1997) Science 
275:657-61; Choo et al., (1997) J Mol Biol 273:525-32). Each zinc finger domain 
usually recognizes three or more base pairs. Since a recognition sequence of 1 8 base 

15 pairs is generally sufficient in length to render it unique in any known genome, a zinc 
finger protein consisting of 6 tandem repeats of zinc fingers would be expected to 
ensure specificity for a particular sequence (Segal et al., (1999) Proc Natl Acad Sci 
USA 96:2758-2763). The artificial zinc finger repeats, designed based on target 
sequences, are fused to activation or repression domains to promote or suppress gene 

20 expression (Liu et al., (1997) Proc Natl Acad Sci USA 94:5525-30). Alternatively, 
the zinc finger domains can be fused to the TATA box-binding factor (TBP) with 
varying lengths of linker region between the zinc finger peptide and the TBP to create 
either transcriptional activators or repressors (Kim et al., (1997) Proc Natl Acad Sci 
USA 94:3616-3620). Such proteins, and polynucleotides diat encode them, have 

25 utility for modulating expression in vivo in both native cells, animals and humans. 
The novel transcription factor can be delivered to the target cells by transfecting 
constructs that express the transcription factor (gene therapy), or by introducing the 
protein. Engineered zinc finger proteins can also be designed to bind RNA sequences 
for use in therapeutics as alternatives to antisense or catalytic RNA methods (McColl 

30 et al, (1999) Proc Natl Acad Sci USA 96:9521-6; Wu et al., (1995) Proc Natl Acad 
Sci USA 92:344-348). 
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Inactivation of Prox-1 function can also be accomplished using an 
overexpressed dominant negative form of Prox-1. As used herein a "dominant 
negative protein" is a mutant form of a protein which has the property of inhibiting 
the function of the endogenous, wild type form of the protein which corresponds to 
5 the mutant protein. Typically, dominant negative proteins have amino acid 

substitutions or are truncated forms of the wild type protein. The mutation may be in 
a substrate-binding domain (or DNA binding domain), a catalytic domain, or a 
- cellular localization domain. For instance, a dominant negative form of Prox-1 may 
include a mutant truncated with respect to the DNA binding domain or transactivation 

10 domain. Disruption of the DNA binding domain entails truncation of the protein to 
exclude amino acids 572-634 of SEQ ID NO. 3, based on homology to Prospero 
(Drosophila). Disruption of the transactivation domain entails the deletion of amino 
acids 635-737. Other dominant negatives may include truncated forms of Prox-1 
lacking the last 60 amino acids or the first 575 amino acids. Preferably, the mutant 

15 polypeptide will be overproduced. Point mutations can be made that have such an 
effect. In addition, fusion of different polypeptides of various lengths to the terminus 
of a protein can yield dominant negative mutants. General strategies for making 
dominant negative mutants are described in Herskowitz, Nature (1987) 329:219-222. 

Anti-sense RNA and DNA molecules, ribozymes, RNAi, triple helix 

20 polynucleotides, and novel transcription factors can be prepared by any method 
known in the art for the synthesis of DNA and RNA molecules. These include 
techniques for chemically synthesizing oligodeoxyribonucleotides well known in the 
art including, but not limited to, solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription 

25 of DNA sequences encoding the antisense RNA molecule. Such DNA sequences may 
be incorporated into a wide variety of vectors which incorporate suitable RNA 
polymerase promoters such as the T7 or SP6 polymerase promoters. Alternatively, 
antisense cDNA constructs that synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably or transiently into cells. 

30 B. Gene Therapy 

As described n detail in the preceding section, a variety of genetic 
manipulations to achieve modulation of Prox-1 protein expression or activity are 
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contemplated. Additionally, where administration of proteins is contemplated, such 
as zinc finger proteins targeted to Prox-1, administration of a gene therapy vector to 
cause the protein of interest to be produced in vivo also is contemplated. Where 
inhibition of proteins is contemplated (e.g., through use of antibodies or small 
5 molecule inhibitors), inhibition of protein expression in vivo by genetic techniques, 
such as knock-out techniques or anti-sense therapy, is contemplated. 

It is now widely recognized that DNA may be introduced into a cell 
using a variety of viral vectors. Exemplary vectors that have been described in the 
literature include replication-deficient retroviral vectors, including but not limited to 

10 lentivirus vectors (Kim et al., J. Virol., 72(1): 811-816 (1998); Kingsman & Johnson, 
Scrip Magazine, October, 1998, pp. 43-46.); adenoviral (see, for example, U.S. Patent 
No. 5,824,544; U.S. Patent No. 5,707,618; U.S. Patent No. 5,792,453; U.S. Patent No. 
5,693,509; U.S. Patent No. 5,670,488; U.S. Patent No. 5,585,362; Quantin et al., Proc. 
Natl. Acad. Sci. USA, 89: 2581-2584 (1992); Stratford-Perricadet et al., J. Clin. 

15 Invest., 90: 626-630 (1992); and Rosenfeld et al., Cell, 68: 143-155 (1992)), retroviral 
(see, for example, U.S. Patent No. 5,888,502; U.S. Patent No. 5,830,725; U.S. Patent 
No. 5,770,414; U.S. Patent No. 5,686,278; U.S. Patent No. 4,861,719), adeno- 
associated viral (see, for example, U.S. Patent No. 5,474,935; U.S. Patent No. 
5,139,941; U.S. Patent No. 5,622,856; U.S. Patent No. 5,658,776; U.S. Patent No. 

20 5,773,289; U.S. Patent No. 5,789,390; U.S. Patent No. 5,834,441 ; U.S. Patent No. 
5,863,541; U.S. Patent No. 5,851,521; U.S. Patent No. 5,252,479; Gnatenko et al., J. 
Investig. Med., 45: 87-98 (1997), an adenoviral-adenoassociated viral hybrid (see, for 
example, U.S. Patent No. 5,856,152) or a vaccinia viral or a herpesviral (see, for 
example, U.S. Patent No. 5,879,934; U.S. Patent No. 5,849,571 ; U.S. Patent No. 

25 5,830,727; U.S. Patent No. 5,661,033; U.S. Patent No. 5,328,688); Lipofectin- 

mediated gene transfer (BRL); liposomal vectors (See, e.g., U.S. Patent No. 5,63 1,237 
(Liposomes comprising Sendai virus proteins)) ; and combinations thereof. All of the 
foregoing documents are incorporated herein by reference in the entirety. 
Replication-deficient adenoviral vectors and adeno-associated viral vectors constitute 

30 preferred embodiments. 

In embodiments employing a viral vector, preferred polynucleotides 
include a suitable promoter and polyadenylation sequence to promote expression in 
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the target tissue of interest. For many applications of the present invention, suitable 
promoters/enhancers for mammalian cell expression include, e.g., cytomegalovirus 
promoter/enhancer (Lehner et al., J. Clin. Microbiol., 29:2494-2502 (1991); Boshart 
et al., Cell, 41:521-530 (1985)); Rous sarcoma virus promoter (Davis et al., Hum. 
5 Gene Ther., 4: 151 (1993)); simian virus 40 promoter, long terminal repeat (LTR) of 
retroviruses, keratin 14 promoter, and myosin heavy chain promoter. 

In other embodiments, non-viral delivery is contemplated. These 
include calcium phosphate precipitation (Graham and Van Der Eb, Virology, 52:456- 
467 (197.3); Chen and Okayama, Mol. Cell Biol, 7:2745-2752, (1987); Rippe, et al, 

10 Mol Cell Biol, 10:689-695 (1990)), DEAE-dextran (Gopal, Mol Cell Biol, 5: 1 188- 
1190 (1985)), electroporation (Tur-Kaspa, et al, Mol. Cell Biol, 6:716-718, (1986); 
Potter, et al, Proc. Nat. Acad. Sci. USA, 81:7161-7165, (1984)), direct microinjection 
(Harland and Weintraub, J. Cell Biol, 101:1094-1099 (1985)), DNA-loaded 
liposomes (Nicolau and Sene, Biochim. Biophys. Acta, 721:185-190 (1982); Fraley, et 

15 al, Proc. Natl Acad. Sci. USA, 76:3348-3352 (1979); Feigner, Sci. Am. , 276(6): 102-6 
(1997); Feigner, Hum. Gene Titer., 7(15): 1791-3, (1996)), cell sonication 
(Fechheimer, et al, Proc. Natl. Acad. Sci. USA, 84:8463-8467 (1987)), gene 
bombardment using high velocity microprojcctiles (Yang, et al, Proc. Natl Acad. 
Sci. USA, 87:9568-9572 (1990)), and receptor-mediated transfection (Wu and Wu, J. 

20 Biol. Chem., 262:4429-4432 (1987); Wu and Wu, Biochemistry, 27:887-892 (1988); 
Wu and Wu, Adv. Drug Delivery Rev., 12:159-167 (1993)). 

In a particular embodiment of the invention, the expression construct 
(or indeed the peptides discussed above) may be entrapped in a liposome. Liposomes 
are vesicular structures characterized by a phospholipid bilayer membrane and an 

25 inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated 
by aqueous medium. They form spontaneously when phospholipids are suspended in 
an excess of aqueous solution. The lipid components undergo self-rearrangement 
before the formation of closed structures and entrap water and dissolved solutes 
between the lipid bilayers (Ghosh and Bachhawat, "In Liver Diseases, Targeted 

30 Diagnosis And Therapy Using Specific Receptors And Ligands," Wu, G, Wu, C, ed., 
New York: Marcel Dekker, pp. 87-104 (1991)). The addition of DNA to cationic 
liposomes causes a topological transition from liposomes to optically birefringent 
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liquid-crystalline condensed globules (Radler, et al, Science, 275(5301):810-4, 
(1997)). These DNA-lipid complexes are potential non-viral vectors for use in gene 
therapy and delivery. 

Liposome-mediated nucleic acid delivery and expression of foreign 
5 DNA in vitro has been very successful. Also contemplated in the present invention 
are various commercial approaches involving "lipofection" technology. In certain 
embodiments of the invention, the liposome may be complexed with a 
hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell 
membrane and promote cell entry of liposome-encapsulated DNA (Kaneda, et al, 

10 Science, 243:375-378 (1989)). La other embodiments, the liposome may be 
complexed or employed in conjunction with nuclear nonhistone chromosomal 
proteins (HMG-1) (Kato, et al, J. Biol. Chem., 266:3361-3364 (1991)). In yet further 
embodiments, the liposome may be complexed or employed in conjunction with both 
HVJ and HMG-1. In that such expression constructs have been successfully 

15 employed in transfer and expression of nucleic acid in vitro and in vivo, then they are 
applicable for the present invention. 

Other vector delivery systems that can be employed to deliver a 
nucleic acid encoding a therapeutic gene into cells include receptor-mediated delivery 
vehicles. These take advantage of the selective uptake of macromolecules by 

20 receptor-mediated endocytosis in almost all eukaryotic cells. Because of the cell 

type-specific distribution of various receptors, the delivery can be highly specific (Wu 
and Wu (1993), supra). 

Receptor-mediated gene targeting vehicles generally consist of two 
components: a cell receptor-specific ligand and a DNA-binding agent. Several 

25 ligands have been used for receptor-mediated gene transfer. The most extensively 

characterized ligands are asialoorosomucoid (ASOR) (Wu and Wu (1987), supra) and 
transferrin (Wagner, et al, Proc. Nat'l AcadSci. USA, 87(9):3410-3414 (1990)). 
Recently, a synthetic neoglycoprotein, which recognizes the same receptor as ASOR, 
has been used as a gene delivery vehicle (Ferkol, et al, FASEB J., 7:1081-1091 

30 (1993); Perales, et al, Proc. Natl. Acad. Set, USA 91:4086-4090 (1994)) and 
epidermal growth factor (EGF) has also been used to deliver genes to squamous 
carcinoma cells (Myers, EPO 0273085). 
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In other embodiments, the delivery vehicle may comprise a ligand and 
a liposome. For example, Nicolau, et al, Methods Enzymol, 149:157-176 (1987) 
employed lactosyl-ceramide, a galactose-terminal asialganglioside, incorporated into 
liposomes and observed an increase in the uptake of the insulin gene by hepatocytes. 
5 Thus, it is feasible that a nucleic acid encoding a therapeutic gene also may be 
specifically delivered into a particular cell type by any number of receptor-ligand 
systems with or without liposomes. 

In another embodiment of the invention, the expression construct may 
simply consist of naked recombinant DNA or plasmids. Transfer of the construct may 

10 be performed by any of the methods mentioned above that physically or chemically 
permeabilize the cell membrane. This is applicable particularly for transfer in vitro, 
however, it may be applied for in vivo use as well. Dubensky, et al. , Proc. Nat. Acad. 
Sci. USA, 81:7529-7533 (1984) successfully injected polyomavirus DNA in the form 
of CaPCU precipitates into liver and spleen of adult and newborn mice demonstrating 

15 active viral replication and acute infection. Benvenisty and Neshif, Proc. Nat. Acad. 
Sci. USA, 83:9551-9555 (1986) also demonstrated that direct intraperitoneal injection 
of CaP0 4 precipitated plasmids results in expression of the transfected genes. 

Another embodiment of the invention for transferring a naked DNA 
expression construct into cells may involve particle bombardment. This method 

20 depends on the ability to accelerate DNA coated microprojectiles to a high velocity 
allowing them to pierce cell membranes and enter cells without kilhng them (Klein, et 
al., Nature, 327:70-73 (1987)). Several devices for accelerating small particles have 
been developed. One such device relies on a high voltage discharge to generate an 
electrical current, which in turn provides the motive force (Yang, et al., Proc. Natl. 

25 Acad. Sci USA, 87:9568-9572 (1990)). The microprojectiles used have consisted of 
biologically inert substances such as tungsten or gold beads. 

Well-known techniques exist for gene delivery to in vivo and ex vivo 
situations. For viral vectors, one generally will prepare a viral vector stock. 
Depending on the type of virus and the titer attainable, one will deliver 1 x 10 4 , 1 x 

30 10 s , 1 x 10 6 , 1 x 10 7 , 1 x 10 s , 1 x 10 9 , 1 x 10 10 , 1 x 10 11 or 1 x 10 12 infectious particles 
to the patient. Similar figures may be extrapolated for liposomal or other non- viral 
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formulations by comparing relative uptake efficiencies. Formulation as a 
pharmaceutically acceptable composition is discussed below. 

Various routes are contemplated for various tumor types. For 
practically any tumor, systemic delivery is contemplated. This will prove especially 
5 important for attacking microscopic or metastatic cancer. Where discrete tumor mass 
may be identified, a variety of direct, local and regional approaches may be taken. 
For example, the tumor may be directly injected with the expression vector or protein. 
A tumor bed may be treated prior to, during or after resection. Following resection, 
one generally will deliver the vector by a catheter left in place following surgery. One 
10 may utilize the tumor vasculature to introduce the vector into the tumor by injecting a 
supporting vein or artery. A more distal blood supply route also may be utilized. 

In an ex vivo embodiment, cells from the patient are removed and 
maintained outside the body for at least some period of time. During this period, a 
therapy is delivered, after which the cells are reintroduced into the patient; preferably, 
15 any tumor cells in the sample have been killed. 

C. Antibodies Immunoreactive with Prox-1 Protein 

In another aspect, the present invention contemplates an antibody that 
is immunoreactive with a Prox-1 protein molecule of the present invention, or any 
portion thereof. Such antibodies include, but are not limited to, polyclonal, 

20 monoclonal, chimeric, single chain, Fab fragments and fragments produced by a Fab 
expression library, bifunctional/bispecific antibodies, humanized antibodies, CDR 
grafted antibodies, human antibodies and antibodies which include portions of CDR 
sequences specific for Prox-1 protein. The antibodies are useful as diagnostic 
reagents for measuring Prox-1 expression in a biological sample {e.g., a biopsy of 

25 colon tissue), and are useful for binding to Prox-1 protein to inhibit Prox-1 activity 
where the antibodies are delivered into cells. 

Neutralizing antibodies, i.e., those which may suppress Prox-1 
expression, are especially preferred for therapeutic embodiments. In a preferred 
embodiment, an antibody is a monoclonal antibody. The invention provides for a 

30 pharmaceutical composition comprising a therapeutically effective amount of an 

antibody directed against Prox-1 protein. The antibody may bind to and neutralize the 
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apoptotic effects of the Prox-1 protein. The antibody may be formulated with a 
pharmaceutically acceptable adjuvant. Means for preparing and characterizing 
antibodies are well known in the art (see, e.g., Harlow and Lane, ANTIBODIES: A 
LABORATORY MANUAL, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
5 N.Y., 1988). 

Briefly, a polyclonal antibody is prepared by immunizing an animal 
with an immunogen comprising a polypeptide of the present invention and collecting 
antisera from that immunized animal. A wide range of animal species can be used for 
the production of antisera. Typically an animal used for production of anti-antisera is 

10 a non-human animal including rabbits, mice, rats, hamsters, goat, sheep, pigs or 

horses. Because of the relatively large blood volume of rabbits, a rabbit is a preferred 
choice for production of polyclonal antibodies. 

Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include but are not limited to 

15 Freund's, mineral gels such as aluminum hydroxide, and surface active substances 
such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants. 

Antibodies, both polyclonal and monoclonal, specific for isoforms of 

20 antigen may be prepared using conventional immunization techniques, as will be 

generally known to those of skill in the art. As used herein, the term "specific for" is 
intended to mean that the variable regions of the antibodies recognize and bind Prox-1 
protein and are capable of distinguishing Prox-1 protein from other antigens, for 
example other secreted proapoptotic factors. A composition containing antigenic 

25 epitopes of the compounds of the present invention can be used to immunize one or 
more experimental animals, such as a rabbit or mouse, which will then proceed to 
produce specific antibodies against the compounds of the present invention. 
Polyclonal antisera may be obtained, after allowing time for antibody generation, 
simply by bleeding the animal and preparing serum samples from the whole blood. 

30 Monoclonal antibodies to Prox-1 protein may be prepared using any 

technique which provides for the production of antibody molecules by continuous cell 
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lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein ( Nature 256: 495-497, 1975), the 
human B-cell hybridoma technique (Kosbor et al, Immunol Today 4:72, 1983 ; Cote 
et al, Proc Natl Acad Sci 80: 2026-2030, 1983) and the EBV-hybridoma technique 
5 (Cole et al, Monoclonal Antibodies and Cancer Therapy, Alan R Liss Inc, New York 
N.Y.,pp 77-96, (1985). 

When the hybridoma technique is employed, myeloma cell lines may 
be used. Such cell lines suited for use in hybridoma-producing fusion procedures 
preferably are non-antibody-producing, have high fusion efficiency, and enzyme 

10 deficiencies that render them incapable of growing in certain selective media which 
support the growth of only the desired fused cells (hybridomas). For example, where 
the immunized animal is a mouse, one may use P3-X63/Ag8, P3-X63-Ag8.653, 
NSl/l.Ag4 1, Sp210-Agl4, FO, NSO/U, MPC-11, MPC11-X45-GTG 1.7 and 
S194/5XX0 Bui; for rats, one may useR210.RCY3, Y3-Ag 1.2.3, IR983F and4B210; 

15 and U-266, GM1500-GRG2, LICR-LON-HMy2 and UC729-6 are all useful in 
connection with cell fusions. It should be noted that the hybridomas and cell lines 
produced by such techniques for producing the monoclonal antibodies are 
contemplated to be novel compositions of the present invention. An exemplary 
method for producing monoclonal antibodies against Prox-1 is provided in Example 

20 1 . Those of skill in the art will appreciate that such a method may be modified using 
techniques well known to those of skill in the art and still produce antibodies within 
the scope of the present invention. 

In addition to the production of monoclonal antibodies, techniques 
developed for the production of "chimeric antibodies", the splicing of mouse antibody 

25 genes to human antibody genes to obtain a molecule with appropriate antigen 

specificity and biological activity can be used (Morrison et al, Proc Natl Acad Sci 81: 
6851-6855, 1984 ; Neuberger et al, Nature 312: 604-608, 1984; Takeda et al, Nature 
314: 452-454; 1985). Alternatively, techniques described for the production of single 
chain antibodies (U.S. Pat. No. 4,946,778) can be adapted to produce Prox-1 protein- 

30 specific single chain antibodies. 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant immunoglobulin libraries or 
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panels of highly specific binding reagents as disclosed in Orlandi et al (Proc Natl 
Acad Sci 86: 3833-3837; 1989), and Winter G and Milstein C (Nature 349: 293-299, 
1991). 

Fully human antibodies relate to antibody molecules in which 
5 essentially the entire sequences of both the light chain and the heavy chain, including 
the CDRs, arise from human genes. Such antibodies are termed "human antibodies," 
or "fully human antibodies" herein. Human monoclonal antibodies can be prepared 
by the trioma technique; the human B-cell hybridoma technique (see Kozbor, et al, 
Immunol Today 4: 72 (1983)) and the EBV hybridoma technique to produce human 

10 monoclonal antibodies (see Cole, et al, 1985 In: Monoclonal Antibodies and 

Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal antibodies may 
be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al, Proc Natl Acad Sci USA 80: 2026-2030 (1983)) 
or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 

15 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96). 

In addition, human antibodies can also be produced using additional 
techniques, including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. 
227:381 (1991); Marks et al, J. Mol. Biol. 222:581 (1991)). Similarly, human 

20 antibodies can be made by introducing human immunoglobulin loci into transgenic 
animals, e.g., mice in which the endogenous immunoglobulin genes have been 
partially or completely inactivated. Upon challenge, human antibody production is 
observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for 

25 example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. 
{Nature 368 856-859 (1994)); Morrison {Nature 368:812-13 (1994)); Fishwild et al,( 
Nature Biotechnology 14, 845-51 (1996)); Neuberger {Nature Biotechnology 14:826 
(1996)); and Lonberg and Huszar {Intern. Rev. Immunol 13:65-93 (1995)). 

30 Human antibodies may additionally be produced using transgenic 

nonhuman animals which are modified so as to produce fully human antibodies rather 
than the animal's endogenous antibodies in response to challenge by an antigen. (See 
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PCT publication WO94/02602). The endogenous genes encoding the heavy and light 
immunoglobulin chains in the nonhuman host have been incapacitated, and active loci 
encoding human heavy and light chain immunoglobulins are inserted into the host's 
genome. The human genes are incorporated, for example, using yeast artificial 
5 chromosomes containing the requisite human DNA segments. An animal which 
provides all the desired modifications is then obtained as progeny by crossbreeding 
intermediate transgenic animals containing fewer than the full complement of the 
"modifications. The preferred embodiment of such a nonhuman animal is a mouse, 
and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 

10 WO 96/34096. This animal produces B cells which secrete fully human 

immunoglobulins. The antibodies can be obtained directly from the animal after 
immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the 
animal, such as hybridomas producing monoclonal antibodies. Additionally, the 

15 genes encoding the himiunoglobulins with human variable regions can be recovered 
and expressed to obtain the antibodies directly, or can be further modified to obtain 
analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified 
as a mouse, lacking expression of an endogenous immunoglobulin heavy chain is 

20 disclosed in U.S. Patent No. 5,939,598. It can be obtained by a method including 
deleting the J segment genes from at least one endogenous heavy chain locus in an 
embryonic stem cell to prevent rearrangement of the locus and to prevent formation of 
a transcript of a rearranged immunoglobulin heavy chain locus, the deletion being 
effected by a targeting vector containing a gene encoding a selectable marker; and 

25 producing from the embryonic stem cell a transgenic mouse whose somatic and germ 
cells contain the gene encoding the selectable marker. 

Antibodies as described herein are useful in standard immunochemical 
procedures, such as ELISA, radioimmuno assays, and Western blot methods and in 
immunohistochemical procedures such as tissue staining, as well as in other 

30 procedures which may utilize antibodies specific to Prox-1 protein -related antigen 
epitopes. Additionally, it is proposed that monoclonal antibodies specific to the 



WO 2005/014854 



PCT/EP2004/008819 



-33- 

particular Prox-1 protein of different species may be utilized in other useful 
applications. 

In general, both polyclonal and monoclonal antibodies against Prox-1 
protein may be used in a variety of embodiments. In certain aspects, the antibodies 
5 may be employed for therapeutic purposes in which the inhibition of Prox-1 protein 
activity is desired (e.g., to reduce apoptosis in neuronal cells). Antibodies may be 
used to block Prox-1 protein action. 

Antibodies of the present invention also may prove useful in diagnostic 
purposes in order, for example, to detect increases or decreases in Prox-1 protein in 

10 tissue samples including samples for sites of inflammation, or fluid samples including 
blood serum, plasma and exudate samples. Additional aspects will employ the 
antibodies of the present invention in antibody cloning protocols to obtain cDNAs or 
genes encoding other Prox-1 protein. They may also be used in inhibition studies to 
analyze the effects of Prox-1 related peptides in cells or animals. Anti- Prox-1 protein 

15 antibodies will also be useful in immunolocalization studies to analyze the 
distribution of Prox-1 protein during various cellular events, for example, to 
detenriine the cellular or tissue- specific distribution of Prox-1 protein polypeptides 
under different points in the cell cycle. A particularly useful application of such 
antibodies is in purifying native or recombinant Prox-1 protein, for example, using an 

20 antibody affinity column. The operation of all such immunological techniques will be 
known to those of skill in the art in light of the present disclosure. 

D. Assaying for Other Modulators of Prox-1 Activity and/or Expression 

In some situations, it may be desirable to identify molecules that are 
modulators, i.e., agonists or antagonists, of the activity of Prox-1 protein. Natural or 
25 synthetic molecules that modulate Prox-1 protein may be identified using one or more 
screening assays, such as those described herein. Such molecules may be 
administered either in an ex vivo manner, or in an in vivo manner by injection, or by 
oral delivery, implantation device or the like. 

"Test molecule(s)" refers to the molecule(s) that is/are under evaluation 
30 for the ability to modulate (i.e., increase or decrease) the activity of Prox-1 protein. 
Most commonly, a molecule that modulates Prox-1 activity will interact directly with 
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Prox-1. However, it is also contemplated that a molecule may also modulate Prox-1 
protein activity indirectly, such as by affecting Prox-1 gene expression, or by binding 
to a Prox-1 binding partner. In one embodiment, a test molecule will bind to a Prox-1 
protein with an affinity constant of at least about 10" 6 M, preferably about 10 s M, 
5 more preferably about 10" 9 M, and even more preferably about 10" 10 M. 

Methods for identifying compounds which interact with Prox-1 protein 
are encompassed by the present invention. In certain embodiments, a Prox-1 protein 
is incubated with a test molecule under conditions which permit the interaction of the 
test molecule with a Prox-1 protein, and the extent of the interaction can be measured. 
10 The test molecule(s) can be screened in a substantially purified form or in a crude 
mixture. 

In certain embodiments, a Prox-1 protein agonist or antagonist may be 
a protein, peptide, carbohydrate, lipid or small molecular weight molecule which 
interacts with Prox-1 to regulate its activity. Molecules which regulate Prox-1 
15 expression include nucleic acids which are complementary to nucleic acids encoding 
a Prox-1 protein, or are complementary to nucleic acid sequences which direct or 
control the expression of Prox-1 protein, and which act as anti-sense regulators of 
expression. 

Once a set of test molecules has been identified as interacting with 
20 Prox-1 protein, the molecules may be further evaluated for their ability to increase or 
decrease Prox-1 activity. The measurement of the interaction of test molecules with 
Prox-1 may be carried out in several formats, including solution-phase assays and 
immunoassays. In general, test molecules are incubated with Prox-1 for a specified 
period of time, and Prox-1 protein activity is determined by one or more assays for 
25 measuring biological activity. 

In the event that Prox-1 displays biological activity through an 
interaction with a binding partner, a variety of in vitro assays may be used to measure 
the binding of Prox-1 to the corresponding binding partner. These assays may be 
used to screen test molecules for their ability to increase or decrease the rate and/or 
30 the extent of binding of Prox-1 to its binding partner. In one assay, a Prox-1 

polypeptide is immobilized in the wells of a microtiter plate. Radiolabeled Prox-1 
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binding partner and the test molecule(s) can then be added either one at a time (in 
either order) or simultaneously to the wells. After incubation, the wells can be 
washed and counted (using a scintillation counter) for radioactivity to determine the 
extent to which the binding partner bound to Prox-1 polypeptide. Typically, the , 
5 molecules will be tested over a range of concentrations, and a series of control wells 
lacking one or more elements of the test assays can be used for accuracy in the 
evaluation of the results. An alternative to this method involves reversing the 
"positions" of the proteins, i.e., immobilizing Prox-1 binding partner to the microliter 
plate wells, incubating with the test molecule and radiolabeled Prox-1 polypeptide, 
10 and determining the extent of Prox- 1 polypeptide binding. See, for example, chapter 
18, Current Protocols in Molecular Biology, Ausubel et at, eds., John Wiley & Sons, 
New York, NY (1995). 

As an alternative to radiolabeling, Prox-1 protein or its binding partner 
may be conjugated to biotin and the presence of biotinylated protein can then be 
15 detected using streptavidin linked to an enzyme, such as horseradish peroxidase 
(HRP) or alkaline phosphatase (AP), that can be detected colorometrically or by 
fluorescent tagging of streptavidin. An antibody directed to Prox-1 or to a Prox-1 
binding partner and conjugated to biotin may also be used and can be detected after 
incubation with enzyme-linked streptavidin linked to AP or HRP. 

20 A Prox-1 protein or Prox-1 binding partner can also be immobilized by 

attachment to agarose beads, acrylic beads or other types of such inert solid phase 
substrates. The substrate-protein complex can be placed in a solution containing the 
complementary protein and the test compound. After incubation the beads can be 
precipitated by centrifugation, and the amount of binding between Prox-1 protein and 

25 its binding partner can be assessed using the methods described herein. Alternatively, 
the substrate-protein complex can be immobilized in a column, and the test molecule 
and complementary protein are passed through the column. The formation of a 
complex between an Prox-1 protein and its binding partner can then be assessed using 
any of the techniques set forth herein, i.e., radiolabeling, antibody binding or the like. 

30 Another in vitro assay that is useful for identifying a test molecule 

which increases or decreases the formation of a complex between Prox-1 and a Prox-1 
binding partner is a surface plasmon resonance detector system such as the BIAcore 
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assay system (Pharmacia, Piscataway, NJ). The BIAcore system may be carried out 
using the manufacturer's protocol. This assay essentially involves the covalent 
binding of either Prox-1 or a Prox-1 binding partner to a dextran-coated sensor chip 
which is located in a detector. The test compound and the other complementary 
5 protein can then be injected, either simultaneously or sequentially, into the chamber 
containing the sensor chip. The amount of complementary protein that binds can be 
assessed based on the change in molecular mass which is physically associated with 
" the dextran-coated side of the sensor chip; the change in molecular mass can be 
measured by the detector system. 

10 In some cases, it may be desirable to evaluate two or more test 

compounds together for their ability to increase or decrease the formation of a 
complex between Prox-1 polypeptide and a Prox-1 binding partner. In these cases, 
the assays set forth herein can be readily modified by adding such additional test 
compound(s) either simultaneous with, or subsequent to, the first test compound. The 

15 remainder of the steps in the assay are as set forth herein. 

In vitro assays such as those described herein may be used 
advantageously to screen large numbers of compounds for effects on complex 
formation by Prox-1 polypeptide and a Prox-1 binding partner. The assays may be 
automated to screen compounds generated in phage display, synthetic peptide, and 
20 chemical synthesis libraries. 

Compounds which increase or decrease the formation of a complex 
between a Prox-1 polypeptide and a Prox-1 binding partner may also be screened in 
cell culture using cells and cell lines expressing either Prox-1 polypeptide or a Prox-1 
binding partner. Cells and cell lines may be obtained from any mammal. The binding 

25 of a Prox-1 protein to cells expressing a Prox-1 binding partner at the surface is 

evaluated in the presence or absence of test molecules, and the extent of binding may 
be determined by, for example; flow cytometry using a biotinylated antibody to a 
Prox-1 binding partner. Cell culture assays can be used advantageously to further 
evaluate compounds that score positive in protein binding assays described herein. 

30 Cell cultures can also be used to screen the impact of a drug candidate. 

For example, drug candidates may decrease or increase the expression of the Prox-1 
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like gene. In certain embodiments, the amount of Prox-1 protein that is produced may 
be measured after exposure of the cell culture to the drug candidate. In certain 
embodiments, one may detect the actual impact of the drug candidate on the cell 
culture. For example, the overexpression of a particular gene may have a particular 
5 impact on the cell culture. In such cases, one may test a drug candidate's ability to 
increase or decrease the expression of the gene or its ability to prevent or inhibit a 
particular impact on the cell culture. In other examples, the production of a particular 
metabolic product such as a fragment of a polypeptide may result in, or be associated 
with, a disease or pathological condition. In such cases, one may test a drug 
10 candidate's ability to decrease the production of such a metabolic product in a cell 
culture. 

E. Internalizing Proteins 

The tat protein sequence (from HTV) can be used to internalize 
proteins into a cell. See e.g., Falwell et al, Proc. Natl. Acad. Sci. USA, 91:664-668 

15 (1994). For example, an 1 1 amino acid sequence (YGRKKRRQRRR; SEQ ID NO: 
46) of the HIV tat protein (termed the "protein transduction domain", or TAT PDT) 
has been described as mediating delivery across the cytoplasmic membrane and the 
nuclear membrane of a cell. See Schwarze etal, Science, 285:1569-1572 (1999); and 
Nagahara et al, Nature Medicine, 4:1449-1452 (1998). In these procedures, FETC- 

20 constructs are prepared which bind to cells as observed by fluorescence-activated cell 
sorting (FACS) analysis, and these constructs penetrate tissues after i.p. 
adminstration.- Next, tat-bgal fusion proteins are constructed. Cells treated with this 
construct demonstrate b-gal activity. Following injection, a number of tissues, 
including liver, kidney, lung, heart and brain tissue, have been found to demonstrate 

25 expression using these procedures. It is believed that these constructions underwent 
some degree of unfolding in order to enter the cell; as such, refolding may be required 
after entering the cell. 

It will thus be appreciated that the tat protein sequence may be used to 
internalize a desired protein or polypeptide into a cell. For example, using the tat 

30 protein sequence, Prox-1 antagonist (such as an anti-Prox-1 binding agent, small 

molecule, or antisense oligonucleotide) can be administered intracellularly to inhibit 



WO 2005/014854 



PCT/EP2004/008819 



the activity of aProx-1 molecule. See also, Strauss, E., Science, 285:1466-1467 
(1999). 

F. Rational Drug Design 

The goal of rational drag design is to produce structural analogs of 
5 biologically active polypeptides or compounds with which they interact' (agonists, 
antagonists, inhibitors, peptidomimetics, binding partners, etc.). By creating such 
analogs, it is possible to fashion drugs which are more active or stable than the natural 
molecules, which have different susceptibility to alteration or which may affect the 
function of various other molecules. In one approach, one generates a three- 

10 dimensional structure for Prox-1 protein or a fragment thereof. This is accomplished 
by x-ray crystallography, computer modeling or by a combination of both approaches. 
An alternative approach, "alanine scan," involves the random replacement of residues 
throughout molecule with alanine, and the resulting affect on function determined. 

It also is possible to isolate a specific antibody, selected by a functional 

15 assay, and then solve its crystal structure. In principle, this approach yields a 

pharmacore upon which subsequent drug design can be based. It is possible to bypass 
protein crystallography altogether by generating anti-idiotypic antibodies to a 
functional, pharmacologically active antibody. As a mirror image of a mirror image, 
the binding site of anti-idiotype would be expected to be an analog of the original 

20 antigen. The anti-idiotype could then be used to identify and isolate peptides from 

banks of chemically- or biologically-produced peptides. Selected peptides would then 
serve as the pharmacore. Anti-idiotypes may be generated using the methods 
described herein for producing antibodies, using an antibody as the antigen. 

Thus, one may design drugs which have activity as stimulators, 
25 inhibitors, agonists, antagonists of Prox-1 protein or molecules affected by Prox-1 
protein function. Such rational drug design may start with lead compounds identified 
by the present invention. By virtue of the availability of cloned Prox-1 protein 
sequences, sufficient amounts of the related proteins can be produced to perform 
crystallographic studies. In addition, knowledge of the polypeptide sequences permits 
30 computer employed predictions of structure-function relationships. 
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G. Therapeutic Methods 

As discussed herein, polynucleotides or modulators of Prox-1 
(including inhibitors of Prox-1) are administered to treat cancer. Therapeutic 
compositions can be administered in therapeutically effective dosages alone or in 
5 ■ combination with adjuvant cancer therapy such as surgery, chemotherapy, 

radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial effect, 
e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or 
otherwise improving overall clinical condition, without necessarily eradicating the 
cancer. 

10 The composition can also be administered in therapeutically effective 

amounts as a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture 
of the polypeptide or modulator of the invention with one or more anti-cancer drugs in 
addition to a pharmaceutically acceptable carrier for delivery. Any anti-cancer drugs 
can be used as a treatment in combination with the polypeptide or modulator of the 

15 invention, including: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, 
Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis-DDP), 
Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, 
Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, 
Etoposide (VI 6-213), Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea 

20 (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon Alpha-2b, 

Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine 
HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), 
Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, 

25 Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, 
Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for 
prophylactic treatment of cancer. There are hereditary conditions and/or 
environmental situations (e.g. exposure to carcinogens) known in the art that 
30 predispose an individual to developing cancers. Under these circumstances, it may be 
beneficial to treat these individuals with therapeutically effective doses of 
compositions of the invention to reduce the risk of developing cancers. 
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In vitro and in vivo models can be used to determine the effective 
doses of the compositions of the invention for cancer treatment. These in vitro 
models include proliferation and differentiation assays of cultured tumor cells, growth 
of cultured tumor cells in soft agar (see Freshney, (1987) Culture of Animal Cells: A 
5 Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), tumor 
systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as 
described in Pilkington et al., Anticancer Res:, 17: 4107-9 (1997), and angiogenesis 
assays such as induction of vascularization of the chick chorioallantoic membrane or 
10 induction of vascular endothelial cell migration as described in Ribatta et al., Intl. J. 
Dev. Biol., 40: 1 189-97 (1999) and li et al., Clin. Exp. Metastasis, 17:423-9 (1999) 
respectively. Suitable tumor cells lines are available, e.g. from American Type Tissue 
Culture Collection catalogs, and/or are described below. 

H. Pharmaceutical Compositions 

15 Purified nucleic acids, antisense molecules, purified protein, 

antibodies, antagonists, or inhibitors may all be used as pharmaceutical compositions. 
Delivery of specific molecules for therapeutic purposes in this invention is further 
described below. 

The active compositions of the present invention include classic 
20 pharmaceutical preparations. Administration of these compositions according to the 
present invention will be via any common route so long as the target tissue is 
available via that route. The pharmaceutical compositions may be introduced into the 
subject by any conventional method, e.g., by intravenous, intradermal, intramusclar, 
intramammary, intraperitoneal, intrathecal, intraocular, retrobulbar, intrapulmonary 
25 {e.g., term release); by oral, sublingual, nasal, anal, vaginal, or transdermal delivery, 
or by surgical implantation at a particular site, e.g., embedded under the splenic 
capsule, brain, or in the cornea. The treatment may consist of a single dose or a 
plurality of doses over a period of time. 

The active compounds may be prepared for administration as solutions 
30 of free base or pharmacologically acceptable salts in water suitably mixed with a 
surfactant, such as hydroxypropylcellulose. Dispersions also can be prepared in 
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glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under 
ordinary conditions of storage and use, these preparations contain a preservative to 
prevent the growth of microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile 
5 aqueous solutions or dispersions and sterile powders for the extemporaneous 

preparation of sterile injectable solutions or dispersions. In all cases the form must be 
sterile and must be fluid to the extent that easy syringability exists. It must be stable 
under the conditions of manufacture and storage and must be- preserved against the 
contaminating action of microorganisms, such as bacteria and fungi. The carrier can 

10 be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), 
suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, 
for example, by the use of a coating, such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by the use of surfactants. The 

15 prevention of the action of microorganisms can be brought about by various 

antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
injectable compositions can be brought about by the use in the compositions of agents 

20 delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active 
compounds in the required amount in the appropriate solvent with various of the other 
ingredients enumerated above, as required, followed by filtered sterilization. 
Generally, dispersions are prepared by incorporating the various sterilized active 

25 ingredients into a sterile vehicle which contains the basic dispersion medium and the 
required other ingredients from those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum-drying and freeze-drying techniques which yield a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile- 

30 filtered solution thereof. 

As used herein, "pharmaceutically acceptable carrier" includes any and 
all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic 
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and absorption delaying agents and the like. The use of such media and agents for 
pharmaceutical active substances is well known in the art. Except insofar as any 
conventional media or agent is incompatible with the active ingredient, its use in the 
therapeutic compositions is contemplated. Supplementary active ingredients also can 
5 be incorporated into the compositions. 

For oral administration the active compositions may be incorporated 
with excipients and used in the form of non-ingestible mouthwashes and dentifrices. 
A mouthwash may be prepared incorporating the active ingredient in the required 
amount in an appropriate solvent, such as a sodium borate solution (Dobell's 

10 Solution). Alternatively, the active ingredient may be incorporated into an antiseptic 
wash containing sodium borate, glycerin and potassium bicarbonate. The active 
ingredient may also be dispersed in dentifrices, including: gels, pastes, powders and 
slurries. The active ingredient may be added in a therapeutically effective amount to 
a paste dentifrice that may include water, binders, abrasives, flavoring agents, 

15 foaming agents, and humectants. 

The compositions of the present invention may be formulated in a 
neutral or salt form. Pharmaceutically-acceptable salts include the acid addition salts 
(formed with the free amino groups of the protein) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
20 organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the 
free carboxyl groups also can be derived from inorganic bases such as, for example, 
sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases 
as isopropylamine, trimethylamine, histidine, procaine and the like. 

The compositions of the present invention may be formulated in a 
25 neutral or salt form. Pharmaceutically-acceptable salts include the acid addition salts 
(formed with the free amino groups of the protein) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the 
free carboxyl groups also can be derived from inorganic bases such as, for example, 
30 sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases 
as isopropylamine, trimethylamine, histidine, procaine and the like. 
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Upon formulation, solutions will be administered in a manner 
compatible with the dosage formulation and in such amount as is therapeutically 
effective. The formulations are easily adrninistered in a variety of dosage forms such 
as injectable solutions, drug release capsules and the like. For parenteral 
5 administration in an aqueous solution, for example, the solution should be suitably 
buffered if necessary and the liquid diluent first rendered isotonic with sufficient 
saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. 

In the clinical setting an "effective amount" is an amount sufficient to 

10 effect beneficial or desired clinical results. An effective amount can be administered 
in one or more doses. In terms of treatment, an "effective amount" of polynucleotide, 
and/or polypeptide is an amount that results in amelioration of symptoms or a 
prolongation of survival in a patient. The effective amount is generally determined by 
the physician on a case-by-case basis and is within the skill of one in the art. Several 

15 factors are typically taken into account when determining, an appropriate dosage. 
These factors include age, sex and weight of the patient, the condition being treated, 
the severity of the condition and the form of the antibody being administered. For 
instance, in embodiments in which the antibody compositions of the present invention 
are being therapeutically administered, it is likely the concentration of a single chain 

20 antibody need not be as high as that of native antibodies in order to be therapeutically 
effective. Determination of the effective amounts is well within the capability of 
those skilled in the art, especially in light of the detailed disclosure provided herein. 
For any compound used in the method of the invention, the therapeutically effective 
dose can be estimated initially from cell culture assays. For example, a dose can be 

25 formulated in animal models to achieve a circulating concentration range that includes 
the IC50 as determined in cell culture (i.e., the concentration of the test compound 
which achieves a half-maximal inhibition of the C-proteinase activity). Such 
information can be used to more accurately determine useful doses in humans. 

Toxicity and therapeutic efficacy of such compounds can be 
30 determined by standard pharmaceutical procedures in cell cultures or experimental 
animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) and 
the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio 
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between toxic and therapeutic effects is the therapeutic index and it can be expressed 
as the ratio between LD50 and ED50. Compounds which exhibit high therapeutic 
indices are preferred. The data obtained from these cell culture assays and animal 
studies can be used in formulating a range of dosage for use in human. The dosage of 
5 such compounds lies preferably within a range of circulating concentrations that 
include the ED50 with little or no toxicity. The dosage may vary within this range 
depending upon the dosage form employed and the route of administration utilized. 
The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in 

10 "The Pharmacological Basis of Therapeutics," Ch. 1 p.l.Dosage amount and interval 
may be adjusted individually to provide plasma levels of the active moiety which are 
sufficient to maintain the C-proteinase inhibiting effects, or minimal effective 
concentration (MEC). The MEC will vary for each compound but can be estimated 
from in vitro data; for example, the concentration necessary to achieve 50-90% 

15 inhibition of the C-proteinase using the assays described herein. Dosages necessary 
to achieve the MEC will depend on individual characteristics and route of 
administration. However, HPLC assays or bioassays can be used to determine plasma 
concentrations. 

Dosage intervals can also be determined using MEC value. 

20 Compounds should be administered using a regimen which maintains plasma levels 
above the MEC for 10-90% of the time, preferably between 30-90% and most 
preferably between 50-90%. In cases of local administration or selective uptake, the 
effective local concentration of the drug may not be related to plasma concentration. 
Refinement of the calculations necessary to determine the appropriate treatment dose 

25 is routinely made by those of ordinary skill in the art without undue experimentation, 
especially in light of the dosage information and assays disclosed herein as well as the 
pharmacokinetic data observed in animals or human clinical trials. As studies are 
conducted, further information will emerge regarding appropriate dosage levels and 
duration of treatment for specific diseases and conditions. 

30 In a preferred embodiment, the present invention is directed at 

treatment of colon cancer, including colon cancer indicated by the presence of 
overexpression of Prox-1. A variety of different routes of administration are 
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contemplated. For example, in the case of a tumor, the discrete tumor mass may be 
injected. The injections may be single or multiple; where multiple, injections are 
made at about 1 cm spacings across the accessible surface of the tumor. 
Alternatively, targeting the tumor vasculature by direct, local or regional intra-arterial 
5 injection are contemplated. The lymphatic systems, including regional lymph nodes, 
present another likely target for delivery. Further, systemic injection may be 
preferred. 

It will be appreciated that the pharmaceutical compositions and 
treatment methods of the invention may be useful in fields of human medicine and 

10 veterinary medicine. Thus the subject to be treated may be a mammal, preferably 
human or other animal. For veterinary purposes, subjects include for example, farm 
animals including cows, sheep, pigs, horses and goats, companion animals such as 
dogs and cats, exotic and/or zoo animals, laboratory animals including mice rats, 
rabbits, guinea pigs and hamsters; and poultry such as chickens, turkey ducks and 

15 geese. 

The compositions may, if desired, be presented in a pack or dispenser 
device which may contain one or more unit dosage forms containing the active 
ingredient. The pack may, for example, comprise metal or plastic foil, such as a 
blister pack. The pack or dispenser device may be accompanied by instructions for 
20 adrninistration. Compositions comprising a compound of the invention formulated in 
a compatible pharmaceutical carrier may also be prepared, placed in an appropriate 
container, and labeled for treatment of an indicated condition. 

H. Transgenic Animals 

A transgenic animal can be prepared in a number of ways. A 
25 transgenic organism is one that has an extra or exogenous fragment of DNA 

incorporated into its genome, sometimes replacing an endogenous piece of DNA. In 
order to achieve stable inheritance of the extra or exogenous DNA, the integration 
event must occur in a cell type that can give rise to functional germ cells. The two 
animal cell types that are used for generating transgenic animals are fertilized egg 
30 cells and embryonic stem cells. Embryonic stem (ES) cells can be returned from in 
vitro culture to a "host" embryo where they become incorporated into the developing 
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animal and can give rise to transgenic cells in all tissues, including germ cells. The ES 
cells are transfected in culture and then the mutation is transmitted into the germline 
by injecting the cells into an embryo. The animals carrying mutated germ cells are 
then bred to produce transgenic offspring. The use of ES cells to make genetic 
5 changed in the mouse germline is well recognized. For a reviews of this technology, 
those of skill in the art are referred to Bronson & Smithies, /. Biol. Chem., 269(44), 
27155-27158, 1994; Torres, Curr. Top. Dev. Biol., 36, 99-114; 1998 and the 
references contained therein. 

Generally, blastocysts are isolated from pregnant mice at a given stage 
10 in development, for example, the blastocyst from mice may be isolated at day 4 of 
development (where day 1 is defined as the day of plug), into an appropriate buffer 
that will sustain the ES cells in an undifferentiated, pluripotent state. ES cell lines 
may be isolated by a number of methods well known to those of skill in the art. For 
example, the blastocysts may be allowed to attach to the culture dish and 
15 approximately 7 days later, the outgrowing inner cell mass picked, trypsinized and 
transferred to another culture dish in the same culture media. ES cell colonies appear 
2-3 weeks later with between 5-7 individual colonies arising from each explanted 
inner cell mass. The ES cell lines can then be expanded for further analysis. 
Alternatively, ES cell lines can be isolated using the immunosurgery technique 
20 (described in Martin, Proc. Natl. Acad. Sci. USA 78:7634-7638, 1981) where the 

trophectoderm cells are destroyed using anti-mouse antibodies prior to explanting the 
inner cell mass. 

Li generating transgenic animals, the ES cell lines that have been 
manipulated by homologous recombination are reintroduced into the embryonic 

25 environment by blastocyst injection (as described in Williams et at, Cell 52: 121-131, 
1988). Briefly, blastocysts are isolated from a pregnant mouse and expanded. The 
expanded blastocysts are maintained in oil-drop cultures at 4°C for 10 minutes prior to 
culture. The ES cells are prepared by picking individual colonies, which are then 
incubated in phosphate-buffered saline, 0.5 nuVI EGTA for 5 minutes; a single cell 

30 suspension is prepared by incubation in a trypsin-EDTA solution containing 1 % (v/v) 
chick serum for a further 5 minutes at 4°C. Five to twenty ES cells (in Dulbecco's 
modified Eagle's Medium with 10% (v/v) fetal calf serum and 3,000 units/ml DNAase 
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1 buffered in 20 mM HEPES [pH 8]) are injected into each blastocyst. The 
blastocysts are then transferred into pseudo-pregnant recipients and allowed to 
develop normally. The transgenic mice are identified by coat markers (Hogan et al, 
Manipulating the Mouse Embryo, Cold Spring Harbor, N.Y. (1986)). Additional 
5 methods of isolating and propagating ES cells may be found in, for example, U.S. 
Patent No. 5,166,065; U.S. Patent No. 5,449,620; U.S. Patent No. 5,453,357; U.S. 
Patent No. 5,670,372; U.S. Patent No. 5,753,506; U.S. Patent No. 5,985,659, each 
incorporated herein by reference. 

An alternative method involving zygote injection method for making 

10 transgenic animals is described in, for example, U.S. Patent No. 4,736,866, 
incorporated herein by reference. Additional methods for producing transgenic 
animals are generally described by Wagner and Hoppe (U.S. Patent No. 4,873,191; 
which is incorporated herein by reference), Brinster et al. Proc. Nat'lAcad. Sci. USA, 
82(13) 4438-4442, 1985; which is incorporated herein by reference in its entirety) and 

15 in Manipulating the Mouse Embryo; A Laboratory Manual, 2nd edition (eds., Hogan, 
Beddington, Costantimi and Long, Cold Spring Harbor Laboratory Press, 1994; which 
is incorporated herein by reference in its entirety). 

Briefly, this method involves injecting DNA into a fertilized egg, or 
zygote, and then allowing the egg to develop in a pseudo-pregnant mother. The 

20 zygote can be obtained using male and female animals of the same strain or from 
male and female animals of different strains. The transgenic animal that is born, the 
founder, is bred to produce more animals with the same DNA insertion. In this 
method of making transgenic animals, the new DNA typically randomly integrates 
into the genome by a non-homologous recombination event. One to many thousands 

25 of copies of the DNA may integrate at a site in the genome 

Generally, the DNA is injected into one of the pronuclei, usually the 
larger male pronucleus. The zygotes are then either transferred the same day, or 
cultured overnight to form 2-cell embryos and then transferred into the oviducts of 
pseudo-pregnant females. The animals born are screened for the presence of the 

30 desired integrated DNA. 
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DNA clones for microinjection can be prepared by any means known 
in the art. For example, DNA clones for microinjection can be cleaved with enzymes 
appropriate for removing the bacterial plasmid sequences, and the DNA fragments 
electxophoresed on 1% agarose gels in TBE buffer, using standard techniques. The 
5 DNA bands are visualized by staining with ethidium bromide, and the band 

containing the expression sequences is excised. The excised band is then placed in 
dialysis bags containing 0.3 M sodium acetate, pH 7.0. DNA is electroeluted into the 
dialysis bags," extracted with a 1:1 phenol: chloroform solution and precipitated by "two 
volumes of ethanol. The DNA is redissolved in 1 ml of low salt buffer (0.2 M NaCl, 

10 20 mM Tris, pH 7.4, and 1 tnM EDTA) and purified on an Elutip-D™ column. The 
column is first primed with 3 ml of high salt buffer (1 M NaCl, 20 mM Tris, pH 7.4, 
and 1 mM EDTA) followed by washing with 5 ml of low salt buffer. The DNA 
solutions are passed through the column three times to bind DNA to the column 
matrix. After one wash with 3 ml of low salt buffer, the DNA is eluted with 0.4 ml 

15 high salt buffer and precipitated by two volumes of ethanol. DNA concentrations are 
measured by absorption at 260 nm in a UV spectrophotometer. For microinjection, 
DNA concentrations are adjusted to 3 mg/ml in 5 mM Tris, pH 7.4 and 0.1 mM 
EDTA. 

Additional methods for purification of DNA for microinjection are 
20 described in Hogan et al. Manipulating the Mouse Embryo (Cold Spring Harbor 

Laboratory, Cold Spring Harbor, NY, 1986), in Palmiter et al. Nature 300:61 1 (1982); 
in The Qiagenologist, Application Protocols, 3rd edition, published by Qiagen, Inc., 
Chatsworth, CA.; and in Sambrook et al. Molecular Cloning: A Laboratory Manual 
(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). 
25 In an exemplary microinjection procedure, female mice six weeks of 

age are induced to superovulate. The superovulating females are placed with males 
and allowed to mate. After approximately 21 hours, the mated females are sacrificed 
and embryos are recovered from excised oviducts and placed in an appropriate buffer, 
e.g., Dulbecco's phosphate buffered saline with 0.5% bovine serum albumin (BSA; 
30 Sigma). Surrounding cumulus cells are removed with hyaluronidase (1 mg/ml). 
Pronuclear embryos are then washed and placed in Earle's balanced salt solution 
containing 0.5 % BSA in a 37.5°C incubator with a humidified atmosphere at 5% 
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C0 2 , 95% air until the time of injection. Embryos can be implanted at the two-cell 
stage. 

Randomly cycling adult female mice are paired with vasectomized 
males. C57BL/6 or Swiss mice or other comparable strains can be used for this 
5 purpose. Recipient females are mated at the same time as donor females. At the time 
of embryo transfer, the recipient females are anesthetized with an intraperitoneal 
injection of 0.015 ml of 2.5 % avertin per gram of body weight. The oviducts are 
exposed by a single midline dorsal incision. An incision is then made through the 
body wall directly over the oviduct. The ovarian bursa is then torn with watchmakers 
10 forceps. Embryos to be transferred are placed in DPBS (Dulbecco's phosphate 
buffered saline) and in the tip of a transfer pipette (about 10 to 12 embryos). The 
pipette tip is inserted into the infundibulum and the embryos transferred. After the 
transfer, the incision is closed by two sutures. The pregnant animals then give birth to 
the founder animals which are used to establish the transgenic line. 

15 I. Use of Prox-l-based Compositions for Diagnostic Purposes 

The demonstration that Prox- 1 is overexpressed in precancerous and 
colon cancer cells also indicates that detection of Prox- 1 polynucleotides and 
polypeptides (including variants thereof) are useful for diagnostic purposes. 
Therefore, preferred aspects of the present invention are directed to methods of 
20 screening and diagnosing colon cancer in an individual. 

In one preferred embodiment, diagnostic methods of the invention are 
practiced through the detection of the Prox-1 protein. In general, methods for 
detecting a polypeptide of the invention can comprise contacting a biological sample 
with a compound that binds to and forms a complex with the polypeptide for a period 
25 sufficient to form the complex, and detecting the complex, so that if a complex is 
detected, a polypeptide of the invention is detected. Prox-1 protein detection can be 
accomplished using antibodies specific for the protein in any of a number of formats 
commonly used by those of skill in the art for such detection. 

For example, elsewhere in the present application, the production and 
30 characterization of monoclonal antibodies specific for Prox-1 is described. Such 
antibodies may be employed in ELISA-based techniques and Western blotting 
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techniques to detect the presence of Prox-1 in a biological sample from a subject 
being tested. Methods for setting up ELISA assays and preparing Western blots of a 
sample are well known to those of skill in the art. The biological sample can be any 
tissue or fluid in which colon cells or tissue might be present. 
5 An anti-Prox-1 antibody or fragment thereof also is useful to monitor 

expression of this protein in individuals suffering from colon cancer. Typically, 
diagnostic assays entail detecting the formation of a complex resulting from the 
binding of an antibody or fragment thereof to Prox-1. For diagnostic purposes, the 
antibodies or antigen-binding fragments can be labeled or unlabeled. The antibodies 

10 or fragments can be directly labeled. A variety of labels can be employed, including, 
but not limited to, radionuclides, fluorescers, enzymes, enzyme substrates, enzyme 
cofactors, enzyme inhibitors and ligands (e.g., biotin, haptens). Numerous 
appropriate immunoassays are known to the skilled artisan (see, for example, U.S. 
Pat. Nos. 3,817,827; 3,850,752; 3,901,654 and 4,098,876). When unlabeled, the 

15 antibodies or fragments can be detected using suitable means, as in agglutination 
assays, for example. Unlabeled antibodies or fragments can also be used in 
combination with another (i.e., one or more) suitable reagent which can be used to 
detect antibody, such as a labeled antibody (e.g., a second antibody) reactive with the 
first antibody (e.g., anti-idiotype antibodies or other antibodies that are specific for the 

20 unlabeled immunoglobulin) or other suitable reagent (e.g., labeled protein A). 

hi one embodiment, the antibodies or fragments of the present 
invention can be utilized in enzyme immunoassays, wherein the subject antibody or 
fragment, or second antibodies, are conjugated to an enzyme. When a biological 
sample comprising a Prox-1 protein is combined with the subject antibodies, binding 

25 occurs between the antibodies and the Prox-1 protein. In one embodiment, a 
biological sample containing cells expressing a mammalian Prox-1 protein, or 
biological fluid containing secreted Prox-1 is combined with the subject antibodies, 
and binding occurs between the antibodies and the Prox-1 protein present in the 
biological sample comprising an epitope recognized by the antibody. These bound 

30 protein can be separated from unbound reagents and the presence of the antibody- 
enzyme conjugate specifically bound to the Prox-1 protein can be determined, for 
example, by contacting the sample with a substrate of the enzyme which produces a 
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color or other detectable change when acted on by the enzyme. In another 
embodiment, the subject antibodies can be unlabeled, and a second, labeled antibody 
can be added which recognizes the subject antibody. 

Similarly, the present invention also relates to a method of detecting 
5 and/or quantitating expression of a mammalian Prox-1 protein or a portion of the 
Prox-1 protein by a cell, in which a composition comprising a cell or fraction thereof 
(e.g., a soluble fraction) is contacted with an antibody or functional fragment thereof 
which binds to a mammalian Prox-1 protein or a portion of the Prox-1 protein under 
conditions appropriate for binding of the antibody or fragment thereto, and binding is 
10 monitored. Detection of the antibody, indicative of the formation of a complex 

between antibody and or a portion of the protein, indicates the presence of the protein. 

The method can be used to detect expression of Prox-1 from the cells 
of an individual (e.g., in a sample, such as a body fluid, such as blood, saliva or other 
suitable sample). The level of expression of in a biological sample of that individual 
15 can also be determined, for instance, by flow cytometry, and the level of expression 
(e.g., staining intensity) can be correlated with disease susceptibility, progression or 
risk. 

In certain other diagnostic embodiments, the polynucleotide sequences 
encoding Prox-1 protein may be used for the diagnosis of conditions or diseases with 

20 which the expression of Prox-1 protein is associated. In general, methods for 
detecting Prox-1 mRNA can comprise contacting a biological sample with a 
compound that binds to and forms a complex with Prox-1 mRNA for a period 
sufficient to form the complex, and detecting the complex in a quantitative or semi- 
quantitative way. Such methods can also comprise amplification techniques 

25 involving contacting a biological sample with nucleic acid primers that anneal to 

Prox-1 mRNA or its complement, and amplifying annealed polynucleotides, so that if 
a polynucleotide is amplified, a polynucleotide of the invention is detected. The 
biological sample can be any tissue or fluid in which Prox-1 -expressing colon cells 
might be present. 

30 hi the amplification procedures, polynucleotide sequences encoding 

Prox-1 protein may be used in hybridization or PCR assays of fluids or tissues from 
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biopsies to detect Prox-1 protein expression. Such methods may be qualitative or 
quantitative in nature and may include Southern or northern analysis, dot blot or other 
membrane-based technologies; PCR technologies; dip stick, pin, chip and ELISA 
technologies. All of these techniques are well known in the art and are the basis of 
5 many commercially available diagnostic kits. 

One such procedure known in the art is quantitative real-time PCR. 
Real-time quantitative can be conveniently accomplished using the commercially 
available ABI PRISM™ 7700 Sequence Detection System, available from PE- 
Applied Biosystems, Foster City, CA and used according to manufacturer's 

10 instructions. PCR reagents can be obtained from PE-Applied Biosystems, Foster 
City, CA. Gene target quantities obtained by real time RT-PCR may be normalized 
using either the expression level of GAPDH, a gene whose expression is constant, or 
by quantifying total RNA using RiboGreen™ (Molecular Probes, Inc. Eugene, OR). 
GAPDH expression is quantified by real time RT-PCR, by being run simultaneously 

15 with the target, multiplexing, or separately. Total RNA is quantified using 

RiboGreen IM RNA quantification reagent from Molecular Probes. Methods of RNA 
quantification by RiboGreen™ are taught in Jones, L.J., et al, Analytical 
Biochemistry, 1998, 265, 368-374. Controls are analyzed in parallel to verify the 
absence of DNA in the RNA preparation (-RT control) as well as the absence of 

20 primer dimers in control samples lacking template RNA. In addition, RT-PCR 
products may be analyzed by gel electrophoresis. 

A reverse transcriptase PCR™ amplification procedure may be 
performed in order to quantify the amount of mRNA amplified. Methods of reverse 
transcribing RNA into cDNA are well known and described in Sambrook et al., 1989. 

25 Alternative methods for reverse transcription utilize thermostable DNA polymerases. 
These methods are described in WO 90/07641, filed December 21, 1990. 

Conditions for incubating a nucleic acid probe or antibody with a test 
sample vary. Incubation conditions depend on the format employed in the assay, the 
detection methods employed, and the type and nature of the nucleic acid probe or 

30 antibody used in the assay. One skilled in the art will recognize that any one of the 
commonly available hybridization, amplification or immunological assay formats can 
readily be adapted to employ the nucleic acid probes or antibodies of the present 
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invention. Examples of such assays can be found in Chard, T., An Introduction to 
Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in 
Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), 
5 Vol. 3 (1985); Tijssen, P., Practice and Theory of immunoassays: Laboratory 
Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1985). The tests of the present invention include cells, 
protein extracts of cells, orhiiological fluids such as, blood, serum, and plasma. The" 
test sample used in the above-described method will vary based on the assay format, 
10 nature of the detection method and the tissues, cells or extracts used as the sample to 
be assayed. Methods for preparing protein extracts or membrane extracts of cells are 
well known in the art and can be readily be adapted in order to obtain a sample which 
is compatible with the system utilized. 

In addition, such assays may be useful in evaluating the efficacy of a 

15 particular therapeutic treatment regime in animal studies, in clinical trials, or in 

monitoring the treatment of an individual patient. In order to provide a basis for the 
diagnosis of disease, a normal or standard measurement of Prox-1 rnRNA or protein 
expression is established. This generally involves Prox-1 measurements from healthy 
colon tissue taken from one or more subjects, measured using the same or similar 

20 reagents used for the test subjects. The healthy subject preferably is matched for sex 
and age, and optionally, ethnicity. Deviation between standard and subject values 
correlates with the presence of precancerous or cancerous tissue. 

Once disease is established, a therapeutic agent is administered; and a 
treatment profile is generated. Such assays may be repeated on a regular basis to 

25 evaluate whether the values in the profile progress toward or return to the normal or 
standard pattern. Successive treatment profiles may be used to show the efficacy of 
treatment over a period of several days or several months. 

Methods to quantify the expression of a particular molecule include 
radiolabeling (Melby et al, J Immunol Methods 159: 235-44, 1993) or biotinylating 

30 (Duplaa et al, Anal Biochem 229-36, 1993) nucleotides, coamplification of a control 
nucleic acid, and standard curves onto which the experimental results are interpolated. 
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In addition to being used as diagnostic methods, screening methods 
also may be used in a prognostic manner to monitor the efficacy of treatment. The 
methods may be performed immediately before, during and after treatment to monitor 
treatment success. The methods also should be performed at intervals, preferably 
5 every three to six months, on disease free patients to insure treatment success. 

In another embodiment of the present invention, kits are provided 
which contain the necessary reagents to carry out the assays of the present invention. 
Specifically, the invention provides a compartment kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container 

10 comprising one of the probes or antibodies of the present invention; and (b) one or 
more other containers comprising one or more of the following: wash reagents, 
reagents capable of detecting presence of a bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are 
contained in separate containers. Such containers include small glass containers, 

15 plastic containers, or strips of plastic or paper. Such containers allow one to 

efficiently transfer reagents from one compartment to another compartment such that 
the biological sample and reagents are not cross -contaminated, and the agents or 
solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept 

20 the test sample, a container which contains, for example, the antibodies used in the 
assay, containers which contain wash reagents (such as phosphate buffered saline, 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound 
antibody or probe. Types of detection reagents include labeled nucleic acid probes, 
labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, 

25 the enzymatic, or antibody binding reagents which are capable of reacting with the 
labeled antibody. One skilled in the art will readily recognize that the disclosed 
probes and antibodies of the present invention can be readily incorporated into one of 
the established kit formats which are well known in the art. 

In further detail, kits for use in detecting the presence of a mammalian 

30 Prox- 1 protein can include an antibody or functional fragment thereof which binds to 
a mammalian Prox-1 protein or portion of this protein, as well as one or more 
ancillary reagents suitable for detecting the presence of a complex between the 
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antibody or fragment and Prox-1 or portion thereof. The antibody compositions of the 
present invention can be provided in lyophilized form, either alone or in combination 
with additional antibodies specific for other epitopes. The antibodies, which can be 
labeled or unlabeled, can be included in the kits with adjunct ingredients. For 
5 example, the antibodies can be provided as a lyophilized mixture with the adjunct 
ingredients, or the adjunct ingredients can be separately provided for combination by 
the user. Generally these adjunct materials will be present in less than about 5% 
weight based on the amount of active antibody, and usually will be present in a total 
amount of at least about 0.001% weight based on antibody concentration. Where a 
10 second antibody capable of binding to the monoclonal antibody is employed, such 
antibody can be provided in the kit, for instance in a separate vial or container. The 
second antibody, if present, is typically labeled, and can be formulated in an 
analogous manner with the antibody formulations described above. 

J. Examples 

15 The present invention is illustrated in the following examples, which - 

are intended to be illustrative and not limiting. Upon consideration of the present 
disclosure, one of skill in the art will appreciate that many other embodiments and 
variations may be made in the scope of the present invention. 

Example 1 provides methods and materials for the subsequent 

20 Examples. 

Example 2 provides experimental results of studies designed to assess 
Prox-1 expression in colorectal cancer cells. 

Example 3 details expression of Prox-1 in round but not in adherent 
subclones of the SW480 colon adenocarcinoma cell line. 
25 Example 4 provides experimental results of Prox- 1 silencing in 

SW480R cells. 

Example 5 describes effects of Prox-1 ablation on Notch signaling in 

SW480R cells. 

Example 6 describes the effects of suppression of Prox-1 on the growth 
30 of SW480R cells in soft agar. 
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Example 7 describes the effect Prox-1 suppression on prostaglandin 

biosynthesis. 

Example 8 describes experiments aimed as assessing the effects of 
altered Notch signaling. 

5 Example 9 describes experiments aimed at assessing the effects of 

Prox-1 suppression on the growth of SW480R tumors in nude mice. 

— - - Example 10 describes analysis of Prox-1 in natural colorectal tumors. 

Example 11 describes one method for diagnosing or screening for 
colorectal cancer. 

10 Example 12 describes experiments designed to compare Prox-1 

expression in normal cololnic epithehum. 

Example 13 describes experiments aimed at assessing Prox-1 
expression in Apc mm/+ mice. 

Example 14 describes studies conducted using SW480R cell line as an 
15 in vitro model to investigate the role of Prox-1 in colorectal carcinoma. 

Example 15 describes experiments to characterize the effects of Prox-1 
suppression and overexpression in colorectal cancer. 

Example 16 describes experiments employing dominant negative 
mutants of Prox-1. 

20 

EXAMPLE 1 
METHODS AND MATERIALS 

Methods and material used or referred to in subsequent examples are 
set forth directly below. 

25 Antibodies 

Monoclonal mouse anti-vimentin, U-catenin (Transduction 
Laboratories), Ki-67 (Pharmingen) and chromogranin A (Ab-3, NeoMarkers), 
monoclonal rat anti-BrdU (Harlan Seralab) and polyclonal rabbit anti-Prox-1 were 
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obtained from the indicated commercial sources. The fluorochrome-conjugated 
secondary antibodies were obtained from Jackson Jjirmunoresearch. 

For production of Prox-1 antibodies cDNA encoding Prox-1 
homeobox domain and prospero domain (amino acids 578-750 of human Prox-1, SEQ 
5 ID NO: 3) was subcloned into pGEX2t vector to produce GST-Prox-1 fusion 

construct. This construct was expressed in E. coli and the GST-Prox-1 fusion protein 
from E. coli was purified using glutathione Sepharose according to the manufacturer's 
instructions (Amersham, Piscataway, NJ). Fusion protein was used to immunize 
rabbits according to a standard protocol. Prox-l-specific antibodies were isolated 
10 from rabbit serum using sequential columns with GST- and GST-Prox-1 -coupled to 
vinylsulfone agarose resin (Sigma). Purified antibody recognized an 85 kD protein in 
lysates from 293T cells transfected with Prox-1 but not from cells transfected with the 
empty vector. 

Synthetic siRNAs 

15 siRNA duplexes were prepared from synthetic 21 nucleotide RNAs 

(Dharmacon Research). siRNA sequences were: 5'- 

CUGC AAGCUGGAUAGUGAAGU-3 ' (Prox-1 siRNA A16 sense) (SEQ ID NO: 4); 
5'-UUCACUAUCCAGCUUGCAGAU-3' (Prox-1 siRNA A16 antisense) (SEQ ID 
NO: 5); 5'-CUAUGAGCCAGUUUGAUAUUU-3' (Prox-1 siRNA A25 sense) (SEQ 
20 ID NO: 6); 5'- AUAUCAAACUGGCUCAUAGUU-3' (Prox-1 siRNA A25 
antisense) (SEQ ID NO: 7). 

EGFP-targeting control siRNA A18 was essentially as described 
(Lewis et al., 2002) except that instead of thymidine 3' overhangs uracil overhangs 
were used; GACGUAAACGGCCACAAGUUU (EGFP siRNA A18 sense) (SEQ ID 
25 NO: 8); ACUUGUGGCCGUUUACGUCUU (EGFP siRNA A18 antisense) (SEQ ID 
NO: 9). 

siRNAs were 2'- ACE deprotected according to the manufacturer's 
instructions, dried in vacuum, resuspended in 400U.1 water, dried again, resuspended 
in water, and annealed to form duplex siRNAs. For annealing equimolar amounts of 
30 siRNA strands (approximately 50-100uM) were incubated in annealing buffer 

(lOOmM potassium acetate 30mM Hepes-KOH pH 7.4, 2mM magnesium acetate) for 
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5 min at +95°C followed by 30 min at +37°C and 30 min at +25°C. After annealing 
the siRNA concentration was measured by spectrometry and siRNA aliquoted and 
stored at -20°C. 

Cell culture, transfection, and soft agar assay 

5 SW480 cells were obtained from ATCC (CCL-228) and cultured in 

RPMI-1640 supplemented with 10% fetal bovine serum, 1 mM glutamine and 
. - . . antibiotics. HepG2 cells were cultured in DMEM, containing 10% fetal bovine serum 
1 mM glutamine and antibiotics. 

Transfection of siRNAs was carried out using Lipofectamine 2000 
10 (Invitrogen) according to manufacturer's instructions using 0.5% (v/v) lipofectamine 
2000 reagent for SW480R and 0.4% (v/v) Upofectamine 2000 for adherent SW480 
cells and either 20nM or lOOnM (f.c.) of siRNA. Transfections were carried out in 
antibiotic-free media for 4-6 hours before changing cells back to normal culture 
media. For long-term experiments siRNA transfections were repeated after 48-72h 
15 from previous transfection (at protein level the silencing effect was seen to remain 
efficient for at least 96h). Normally approximately 90-95% transfection efficiency 
was achieved. Opti-MEM (Invitrogen) medium was used in preparation of 
transfection mixtures. 

For luciferase assays, cells were transfected with Green Fluorescent 
20 Protein small interfering RNA (GFPsi RNA) or Prox-1 siRNAs 72 h prior to the 
transfection with the firefly lucefarese reporter constructs CBFl-luc, control pGL2- 
luc (Promega), TOPFlash and FOPFlash (Upstate). To normalize the transfection 
efficiency, cells were co-transfected with the Renilly firefly reporter pRL-TK 
(Promega). 36 h after the last transfection cells were lysed and lysates were analyzed 
25 for the luciferase activity using Dual- Luciferase™ kit according to the manufacturer' s 
instructions (Promega). 

For soft agar assay, 2xl0 3 and 2xl0 4 cells were seeded in triplicate in 1 
ml of 0.33% (w/v) agar (Difco) containing D-MEM, 10% fetal bovine serum, 1 mM 
glutamine and antibiotics in 6-well plates containing 1ml of 0.5% bottom agar layer. 
30 Cells were fed twice a week, and number of colonies per plate was scored after two 
weeks in culture. 
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RNA isolation, Northern, and Western blotting 

Total RNA was isolated and DNAsel treated in RNeasy columns 
(Qiagen). For Cancer Array analysis, filters were hybridized in ExpressHyb with 
32P-labeled probes for LYVE-1 and Prox-1 according to the manufacturer's 
5 instructions (Clontech). For Northern analysis, the blots were hybridized in Ultrahyb 
solution (Ambion) with 32P-labeled probes produced by RT-PCR using RNA from 
SW480R or SW480A cells. The primers were designed to amplify 300-700 bp of the 
coding sequence, and all PCR-fragments were sequenced to confirm their identity. 

For the Affymetrix® gene expression analysis, sample preparations and 

10 hybridizations were carried out as described (Petrova et al. Embo J 21 : 4593-9, 2002), 
using RNA extracted from two clones of SW480R or SW480A cells, or from two 
independent transfections of two different clones of SW480R cells with GFP siRNA 
or Prox-1 siRNA A16. To confirm the latter results, another transfection was carried 
out using Prox siRNA A25. To exclude the non-specific effects due to the 

15 transfection itself, non-transfected SW480R cells grown in parallel were also 
analyzed. 

For Western blotting 2xl0 5 cells were lysed in 500 ul of sample buffer, 
lysates were separated using 10% PAGE and transferred to the nitrocellulose 
membranes (Schleisher&Schull) using semi-dry transfer method for 1 h at 300 mA. 

20 Membranes were blocked in 5% non-fat dry milk, 0.1% Tween-20 in 50 mM Tris- 
HC1, pH 7.4, 150 mM NaCl, and incubated overnight with primary antibodies. Bound 
primary antibodies were detected using HRP-conjugated corresponding secondary 
antibodies and the ECL detection method (KPL). 
Immunofluorescence and immunohistochemistry 

25 The cells were cultured on coverslips, fixed with MetOH and stained 

with the primary antibodies and fluorochrome-conjugated secondary antibody. F- 
actin was stained using TexasRed-conjugated phalloidin (Molecular Probes). Cells 
were counterstained with Hoechst 33258 fluorochrome (Sigma) and viewed in Zeiss 
Axioplan 2 fluorescent microscope. 

30 For tissue staining staining, colon tumors and normal colon samples 

were embedded in Tissue-Tek® (Sakura), frozen and sectioned. The 4um sections 
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were fixed in cold methanol for 10 min and stained with the primary antibodies 
followed by peroxidase staining using Vectastain Elite ABC kit (Vector Laboratories) 
and 3-amino-9-ethyl carbazole (Sigma), or by detection using fluorchrome conjugated 
secondary antibodies. 

5 EXAMPLE 2 

Prox-1 mRNA is Elevated in Colorectal Tumors 

Experiments were conducted to assess the expression of Prox-1 mRNA 
in human cancers using a cancer gene profiling array filter, which contains cDNAs 
from about 250 human cancers and corresponding normal control tissues. Prox-1 

10 mRNA was significantly increased in 35 out of 53 samples of colorectal cancers. In 
contrast, only rarely or not at all was any increase seen in samples from breast, 
uterine, lung, kidney, ovarian, or thyroid tumors (Fig. 1A, B, and C). Probes for 
Prox-1 (Fig. 1A) and the lymphatic endothelial marker LYVE-1 (Fig. IB) were used. 
Fig. 1C demonstrates quantification of dot blot in Fig. 1A, the asterisk indicating 

15 tumor samples in which Prox-1 expression is significantly different from that of the 
normal tissue (P<0.005). Expression of Prox-1 was low or absent in all kidney 
cancer samples studied. Prox-1 is a marker for lymphatic vessels, which are abundant 
both in normal colonic submucosa and around colon carcinomas (White et al., Cancer 
Res. 62: 1669-75 (2002)). Therefore, the filter to the probe for the lymphatic 

20 endothelial hyaluronan receptor LYVE-1 was hybridized. Unlike Prox-1, LYVE-1 
levels were higher in the normal samples, suggesting that the increased expression 
cannot be attributed to the lymphatic vessels (Fig. IB). 

Experiments were further conducted to assess the expression of Prox-1 
in colon cancers and premalignant colonic lesions using affinity purified antibodies 

25 raised against Prox-1 homeobox and prospero domains, which are conserved between 
the mouse and human proteins. Staining of a panel of mouse tissues and E12.5 and 
E17.5 embryos revealed specific nuclear staining for Prox-1 in the previously reported 
sites of expression such as in lymphatic vessels, lens fiber cells and in a subset of 
neurons in the neural tube. Staining of eleven human colorectal adenomas and nine 

30 carcinomas and adj acent normal mucosa revealed increased expression of Prox- 1 in 
nine adenomas and in six carcinomas (Fig. 2A-I). Increased Prox-1 staining was 
observed in all cells in seven adenomas and in two carcinomas, whereas in the other 
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lesions a heterogeneous expression of Prox-1 occurred. In one tumor sample, no 
specific staining for Prox-1 was seen, while strong expression was observed in 
intratumoral lymphatic vessels. 

Double immunofluorescent staining for Prox-1 and the neuroendocrine 
5 marker chromogranin A or proliferation marker Ki-67 was conducted in normal 

colonic epithelial cells. Nuclei were visualized with Hoechst 333421. In the normal 
colonic mucosa, Prox-1 was strongly expressed in some epithelial cells, a subset of 
which was positive for the pan-neuroendocrine marker chromogranin A. In addition, 
a weaker but significant Prox-1 expression was observed in the bottom of the crypts 
10 below the cell proliferation zone identified by staining for the Ki-67 antigen. The 
location of Prox-1 positive cells at the base of the crypts corresponds to the position 
of the intestinal stem cells (Bach et al., Carcinogenesis 21: 469-76 (2000)). 

EXAMPLE 3 

PROX-1 IS EXPRESSED IN ROUND BUT NOT IN ADHERENT SUBCLONES 
15 OF THE SW480 COLON ADENOCARCINOMA CELL LINE. 

Additional studies were conducted to compare Prox-1 expression in 
various cells. No Prox-1 expression was seen in the majority of tumor cell lines 
studied. However, Prox-1 was mRNA was present in hepatocellular carcinoma cell 
line HepG2 and the colon carcinoma cell line SW480. BEC, blood endothelial cells, 

20 CAEC, coronary artery endothelial cells, and LEC, lymphatic endothelial cells, served 
as negative and positive controls. Immunofluorescent staining of Prox-1 revealed 
strong expression in all HepG2 cells, whereas only a subset of SW480 cells were 
Prox-1 positiveDouble immunofluorescent staining for Prox-1 and for fi-catenin or for 
the F-actin marker phalloidin demonstrated that Prox-1 expression is restricted to 

25 weakly adherent round SW480 cells which did not display focal adhesions or actin 
stress fibers, and that Prox-1 was very weakly expressed the adherent cells. The 
existence of two subtypes of cells in the SW480 cultures has been reported previously 
(Palmer, H. G. et al., J Cell Biol. 154: 369-87, 2001; Tornita, N. et al., Cancer Res. 
52; 6840-7, 1992). The SW480R (round) cells displayed anchorage independent 

30 growth in vitro and highly malignant phenotype in vivo, whereas the SW480A 
(adherent) cells did not grow well in soft agar and formed small and well 
differentiated tumors when implanted into nude mice. 
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Several SW480R and SW480A clones were isolated, which could be 
continuously grown for at least 20 passages without conversion of phenotypes. 
SW480R and SW480A cells differed by the levels of Prox-1, as determined by 
Northern and Western blotting, with much higher expression in the round cells, and 
5 weak, if any, expression in the Adherent ones. The gene expression profiles of 
SW480R and SW480A cells were compared using oligonucleotide microarrays 
containing 22,000 annotated human genes, and identified about 1,000 genes whose 
expression differed by more than fourfold between these two cell types (Table I). 
SW480 cells were stained for intermediate filament protein vimentin and Prox-1. 

10 Northern blotting and hybridization were used for transcripts. Hybridization for 

GAPDH was used as a control. A striking difference was observed in the expression 
of cytoskeletal and cell adhesion proteins. In agreement with their decreased 
adhesion and round cell shape, the SW480R cells lacked many components of the 
actin, intermediate filament and microtubule networks, such as gelsolin, filamins A 

15 and B, ezrin, moesin, vimentin, various integrins, and tubulins (Table I). These cells 
expressed higher levels of the protoncogene c-met, as well as the receptor tyrosine 
kinase FGFR-4, which has been associated with malignant transformation in 
colorectal and other cancer (Bange, J. et al., Cancer Res. 62: 840-7, 2002; Cavallaro, 
U., Niedermeyer, J., Fuxa, M. & Christofori, G., Nat. Cell Biol. 3: 650-7, 2001; 

20 Yamada, S. M. et al., Neurol Res. 24: 244-8, 2002), and low levels of the tumor 

suppressor p21Cipl. FGFR-4 is a target for therapeutic intervention according to the 
invention, alone or in combination with Prox-1. Intervention using the same classes 
of inhibitors as described for Prox-1, as well as antibodies and antibody fragment 
substances, is specifically contemplated. In addition, all three tissue inhibitors of 

25 matrix metalloproteinases were absent from the SW480R cells, which may further 
account for their increased tumor growth in vivo. In contrast, the SW480A cells 
expressed higher levels of the chemokine receptor CXCR4, which is expressed in the 
normal colonic epithelium (Jordan et al., J Clin Invest 104, 1061-9, 1999). In 
summary, the gene expression profile of the SW480R cells correlates well with a 

30 highly aggressive transformed phenotype, whereas the SW480A cells display more 
differentiated features typical of cells in the colonic crypts. 
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Table I. Examples of groups of genes differentially expressed in round versus 
adherent SW480 clones. Two round and two adherent clones were analyzed. 



Gene function and name 


UniGene 


Gene 


Log 2 ratio, 


St. 




cluster 


symbol 


average 


dev 


1. Cytoskeleton and adhesion 










collagen, type XHI, alpha 1 


Hs.211933 


COL13A1 


-5.6 


0.9 


fibronectin 1 


Hs.287820 


FN1 


-5.2 


0.5 


integrin, alpha 7 


Hs.74369 


ITGA7 


-4.3 


0.3 


vimentin 


Hs.297753 


VIM 


-4.1 


0.6 


filaminB, beta (actin binding protein 278) .. . 


Hs.81008 


FLNB 


-3.8 


0.7 


integrin, beta 5 


Hs. 149846 


ITGB5 


-3.6 


0.5 


tubulin, beta polypeptide 


Hs.274398 


TUBE 


-3.3 


0.7 


PTPL1 -associated RhoGAP 1 


Hs.70983 


PARG1 


-3.0 


0.5 


collagen, type DC, alpha 3 


Hs.53563 


COL9A3 


-2.8 


0.8 


paralemmin 


Hs.78482 


PALM 


-2.7 


0.2 


PDZ and LIM domain 1 (elfin) 


Hs.75807 


PDLIM1 


-2.7 


0.2 


cadherin 11, type 2, OB-cadherin (osteoblast) 


Hs.75929 


CDH11 


-2.6 


0.7 


myosin IC 


Hs.286226 


MYOIC 


-2.6 


0.6 


integrin, alpha 3 


Hs.265829 


ITGA3 


-2.6 


0.4 


discs, large (Drosophila) homolog 1 


Hs. 154294 


DLG1 


-2.5 


0.1 


integrin, alphaV 


Hs.295726 


ITGAV 


-2.5 


0.3 


CDC42 effector protein (Rho GTPase 


Hs.260024 








binding) 3 




CDC42EP3 


-2.4 


0.4 


ephrin-Bl 


lis. 144700 


EFNB1 


-2.3 


0.4 


FERM, RhoGEF (ARHGEF) and pleckstrin 


Hs. 183738 








domain protein 1 




FARP1 


-2.3 


0.4 


myosin ID 


Hs.39871 


MYOID 


-2.1 


0.2 


PDZ and LIM domain 2 (mystique) 


Hs.379109 


PDLIM2 


-2.1 


0.4 


tubulin beta-5 


Hs.274398 


TUBB-5 


-1.9 


0.3 


erythrocyte membrane protein band 4.1-like 1 Hs.26395 


EPB41L1 


-1.9 


0.1 


gelsolin (amyloidosis, Finnish type) 


Hs.290070 


GSN 


-1.9 


0.3 


laminin, gamma 1 (formerly LAMB2) 


Hs.432855 


LAMC1 


-1.8 


0.1 


ras homolog gene family, member E 


Hs.6838 


ARHE 


-1.7 


0.2 


IQ motif containing GTPase activating 


Hs.1742 








protein 1 




IQGAP1 


-1.7 


0.3 


tight junction protein 1 (zona occludens 1) 


Hs.74614 


TJP1 


-1.7 


0.4 


catenin (cadherin-associated protein), alpha- 


Hs.58488 








like 1 




CTNNAL1 


-1.7 


0.6 


collagen, type XVDX alpha 1 


Hs.78409 


COL18A1 


-1.6 


01 


filamin A, alpha (actin binding protein 280) 


Hs. 195464 


FLNA 


-1.6 


0.2 


actin related protein 2/3 complex, subunit 1 A, Hs.90370 








41kDa 




ARPC1A 


-1.5 


0.3 


alpha integrin binding protein 63 




AIBP63 


-1.4 


0.3 


spectrin, alpha, non-erythrocytic 1 (alpha- 


Hs.77196 








fodrin) 




SPTAN1 


-1.4 


0.2 


villin 2 (ezrin) 


Hs.155191 


VIL2 


-1.4 


0.3 


actin related protein 2/3 complex, subunit IB, Hs.433506 








41kDa 




ARPC1B 


-1.3 


0.1 


plakophiUn 4 


Hs. 152151 


PKP4 


-1.3 


0.3 


ras homolog gene family, member C 


Hs. 179735 


ARHC 


-1.1 


0.1 


moesin 


Hs. 170328 


MSN 


-1.1 


0.1 
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myristoylated alanine-rich protein kinase C Hs.75607 
substrate 

2. Tumor growth and invasion 

tissue inhibitor of metalloproteinase 2 Hs.6441 

tissue inhibitor of metalloproteinase 3 Hs.245 1 88 

Cyclin-dependent kinase inhibitor 1A (p21, Hs. 179665 
Cipl) 

tissue inhibitor of metalloproteinase 1 Hs.5831 
met proto-oncogene (hepatocyte growth factorHs.3 16752 
receptor) 

Fibroblast growth receptor 4 Hs. 1 65950 

3. Expressed in normal intestinal epithelium 
CXCR4 Hs.89414 
solute carrier family 7 (cationic amino acid Hs.22891 
transporter, y+ system), member 8 

4. Notch pathway 

Notch homolog 2 (Drosophila) Hs.8121 

hairy homolog (Drosophila), HES1 Hs.250666 

jagged 2 Hs.166154 

5. Wnt pathway 

wingless-type MMTV integration site family, Hs. 152213 
member 5A 

dickkopf homolog 3 Hs.4909 
wingless-type MMTV integration site family, Hs.29764 
member 6 

frizzled homolog 7 (Drosophila) Hs. 173859 

frizzled homolog 2 (Drosophila) Hs . 8 1 2 1 7 

frizzled homolog 10 (Drosophila) Hs.3 1 664 

dickkopf homolog 4 Hs.159311 



MARCKS 


-1.1 


0.2 


TIMP2 


-2.3 


0.21 


TIMP3 


-1.5 


0.14 


CDKN1A 


-2.5 


0 


TIMP1 


-1.5 


0.4 


MET 


2.6 


0.46 


FGFR4 


3.9 


0.76 






" 


SLC7A8 


-1.8 




NOTCH2 


-1.4 


0.15 


HRY 


-2.1 


0.2 


JAG2 


1.6 


0.61 






0.12 


WNT5A 


-5.8 




DKK3 


-5.6 


1.21 






0.23 


WNT6 


-4.2 




FZD7 


^.1 


0.65 


FZD2 


-3.7 


0.56 


FZD10 


2.97 


0.86 


DKK4 


7.37 


0.71 



EXAMPLE 4 

PROX-1 SILENCING IN SW48QR CELLS LEADS TO A DIFFERENTIATED 
AND QUIESCENT PHENOTYPE. 

5 Experiments were conducted to investigate whether Prox- 1 plays role 

in the generation and maintenance of the highly transformed phenotype. Prox-1 
mRNA and protein in the SW480R cells was suppressed using Prox-1 targeting 
siRNA. Absence of Prox-1 in Prox-1 siRNA but not the control GFP siRNA 
transfected cells was confirmed by immunofluorescent staining, and nuclei were 
10 visualized with Hoechst 33342. Prox-1 siRNA-transfected cells but not the 

untransfected or GFP siRNA transfected cells underwent a morphological change, 
which became visible by 72 hours and persisted at least for 10 days after the transient 
transfection. The Prox-1 siRNA transfected cells become first more elongated and 
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displayed extensive membrane raffling. Eventually the Prox-1 siRNA cells started to 
spread on the plate and a number of increased actin stress fibers could be visualized 
by phalloidine staining. BrdU incorporation experiments demonstrated that the Prox- 
1 siRNA transfected cells proliferated at the lower rate than GFPsi or nontransfected 
5 cells (22+0.5% of BrdU positive cells in Prox-1 siRNA A 16, 18+1% Prox-1 siRNA 
A25 vs 34±4% GFP siRNA). 

Changes in the gene expression profiles of the SW480R and SW480A 
cells 120 and 240 h posttransfection, when the morphological changes were apparent, 
were also analyzed. Only 29 down-regulated and 120 upregulated genes in Prox-1 

10 siRNA versus GFP siRNA transfected cells (Table II) were identified. 41% of these 
genes were differentially expressed between the SW480R and SW480A cells, 
suggesting that Prox-1 at least partially determines the phenotype of SW480R cells. 
The ablation of Prox-1 led to upregulation of a number of known epithelial markers, 
such as annexin Al, CRPB2, S100A3, and EMP1, along with the increase in cell 

15 adhesion molecules OB-cadherin and integrins beta7, beta5 and alpha 1. In line with 
the observed growth arrest, also observed was the decrease in c-myc and a strong 
increase of CDK inhibitor p21Cipl. Highly similar changes in gene expression 
profile were observed when another unrelated Prox-1 si RNA was used, suggesting 
that the cellular effects are due to the specific targeting of Prox-1, and they did not 

20 result from off- target silencing. In addition, titration experiments demonstrated that 
the induction of p21 and other target genes occurred even at the low (20 nM) 
concentration of Prox-1 siRNAs but not of the control GFP siRNA. Also, the 
mentioned gene changes were not observed in Prox-1 negative SW480A cells 
transfected with siRNAs at 100 nm concentration. The transfection efficiency was 

25 controlled using another siRNA, which successfully suppressed the expression of the 
target gene in SW480A cells. 
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Table H Genes regulated by Prox-1 in SW480R cells. Asterisk indicates genes 
that were flagged as absent in either Prox-1 siRNA or GFP siRNA treated cells. 
Genes differentially expressed between SW480R and SW480ADH cells are shown in 
bold. 



Genes down-regulated in the absence of Prox-1 


UniGene 


Gene 


Log 2 ratio, 


stdev 




cluster 


symbol 


average 




Nebulette 


Hs.5025 


NEBL 


-2.0 


0.4 


transforming growth factor, beta-induced, 
68kDa 


Hs. 11 8787 


TGFBI 


-1.9 


0.1 


trinucleotide repeat containing 9 


Hs. 110826 


TNRC9 


-1.9 


0.2 


insulin-like growth factor binding protein 3 


Hs.77326 


IGFBP3 


-1.6 


0.0 


calpain 1, (rau/I) large subunit 


Hs.2575 


CAPN1 


-1.5 


0.3 


inhibitor of DNA binding 1 


Hs.75424 


ID1 


-1.5 


0.3 


midkine (neurite growth-promoting factor 2) 


Hs.82045 


MDK 


-1.5 


0.1 


FK506 binding protein 11,19 kDa 


Hs.24048 


FKBP11 


-1.4 


0.1 


caspase recruitment domain family, member 10 


Hs.57973 


CARD10 


-1.3 


0.1 


inhibin, beta B (activin AB beta polypeptide) 


Hs.1735 


INHBB 


-1.3 


0.2 


LI cell adhesion molecule 


Hs.1757 


L1CAM 


4.2 


0.1 


glutathione peroxidase 2 (gastrointestinal) 


Hs.2704 


GPX2 


-1.2 


0.0 


eukaryotic translation elongation factor 1 alpha Hs.2642 


EEF1A2 


-1.2 


0.2 


hypothetical protein FLJ1 1149 


Hs.37558 


FLJ11149 


-1.2 


0.2 


potassium voltage-gated channel, subfamily H 


Hs. 188021 


KCNH2 


-1.1 


0.1 


(eag-related), member 2 










KIAA0182 protein 


Hs.75909 


KIAA0182 


-1.1 


0.0 


lectin, galactoside-binding, soluble, 1 (galectin 1) 


Hs.382367 


LGALS1 


-1.1 


0.1 


Homo sapiens cDNA FLJ41000 fis, 






-1.1 


0.3 


ephrin-B2 


Hs.30942 


EFNB2 


-1.1 


0.1 


v-myc myelocytomatosis viral oncogene 


Hs.79070 


MYC 


-1.1 


0.1 


homolog (avian) 










S100 calcium binding protein A14 


Hs.288998 


S100A14 


-1.1 


0.2 


Alpha one globin [Homo sapiens], mRNA 






-1.1 


0.1 


sequence* 






-1.0 


0.0 


hypothetical protein FLJ10986* 


Hs.273333 


FLJ10986 


hypothetical protein FLJ11149 


Hs.37558 


FLJ11149 


-1 


0.0 


myelin transcription factor 1* 




MYT1 


-1.0 


0.0 


nucleolar autoantigen (55kD) similar to rat 


Hs.446459 


SC65 


-1.0 


0.1 


synaptonemal complex protein* 










tumor necrosis factor receptor superfamily, 


Hs.455817 


TNFRSF6B 


-1.0 


0.1 


member 6b, decoy 










jagged 2 


Hs.166154 


JAG2 


-1.0 


0.1 


mitochondrial ribosomal protein S2 


Hs.20776 


MRPS2 


-1.0 


0.1 


Total: 29 genes 
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Genes up-regulated in the absence of Proxl 


UniGene 




Log2 ratio. 


Stdev 




cluster 


symbol 


average 




insulin-like growth factor binding protein 7* 


Hs.119206 


IGFBP7 


5.8 


0.4 


chitinase 3-like 1 (cartilage glycoprotein-39)* 


Hs.75184 


CHI3L1 


5.3 


0.8 


chemokine (C-X-C motif) receptor 4* 


Hs.89414 


CXCR4 


4.5 


1.1 


semaphorin 3C* 


Hs.171921 


SEMA3C 


4.5 


4.5 


cadherin 11, type 2, OB-cadherin (osteoblast)* 


Hs .75929 


CDH11 


3.8 


0.3 




Hs.78225 


ANXA1 


3.7 


1.1 


hypothetical protein MGC10796* 




MGC1079 
6 


3.3 


0.4 


GD44 antigen 


Hs.169610 


CD44 


2.6 


1.1 


Homo sapiens clone 23785 mRNA sequence 






2.9 


0.4 


epithelial membrane protein 1* 


Hs.79368 


EMP1 


2.9 


0.1 


inhibitor of DNA binding 2, dominant negative 


Hs.180919 


ID2 


2.8 


0.1 


helix-loop-helix protein* 










Human HepG2 3' region cDNA, clone hmdlf06, 






2.8 


0.3 


mRNA sequence 










tumor necrosis factor receptor superfamily, 


Hs.81791 


TNFRSF1 


2.6 


0.7 


member *llb (osteoprotegerin) 




IB 






likely homolog of mouse glucuronyl C5-epimerase* 


Hs. 183006 


GLCE 


2.6 


1.1 


ribonuclease, RNase A family, 1 (pancreatic)* 


Hs.78224 


RNASE1 


2.6 


0.1 


apolipoprotein B mRNA editing enzyme, catalyticHs.226307 


APOBEC 


2.5 


0.1 


polypeptide-like 3B* 




3B 






hydroxyprostaglandin dehydrogenase 15-(NAD)* Hs.77348 


HPGD 


2.5 


1.1 


NPD009 protein 


Hs.283675 


NPD009 


2.5 


0.6 


integrin, beta 7* 


Hs.1741 


ITGB7 


2.4 


0.1 


fibroblast growth factor 20* 


lis. 154302 


FGF20 


2.3 


1.0 


KIAA0455 gene product 


Hs. 13245 


KIAA0455 


2.3 


1.3 


CAMP-specific phosphodiesterase 8B1 [Homo 


Hs.78106 


PDE8B 


2.3 


0.4 


sapiens], mRNA sequence* 










ectodermal-neural cortex (with BTB-like 


Hs.104925 


ENC1 


2.3 


0.2 


domain)* 










frizzled homolog 1 (Drosophila)* 


Hs.94234 


FZD1 


2.3 


0.8 


S 100 calcium binding protein A3* 


Hs.433168 


S100A3 


2.2 


0.6 


zeta-chain (TCR) associated protein kinase 70kDa* 


Hs.234569 








platelet derived growth factor C* 


Hs.43080 


PDGFC 


2.1 


0.1 


cystatin D * 


Hs. 121489 


CST5 


2.1 


0.3 


CCAAT/enhancer binding protein (C/EBP), delta Hs.76722 


CEBPD 


2.1 


0.1 


sorbin and SH3 domain containing 1 


Hs. 108924 


SORBS1 


2.1 


0.5 


metallothionein 2A 


Hs. 11 8786 


MT2A 


2.0 


0.6 


RAS guanyl releasing protein 1 (calcium and DAG- Hs.182591 


RASGRP1 


2.0 


0.4 


checkpoint suppressor 1 


Hs.211773 


CHES1 


2.0 


0.4 


chondroitin betal,4 N- 


Hs.11260 


ChGn 


2.0 


0.4 


acetylgalactosaminyltransferase* 

filamin B, beta (actin binding protein 278)* 


Hs.81008 


FLNB 


2.0 


0.4 


aldehyde dehydrogenase 1 family, member A2* 


Hs.95197 


ALDH1A2 


2.0 


0.6 


jagged 1 (Alagille syndrome) 


Hs.91143 


JAG1 


2.0 


0.1 


A kinase (PRKA) anchor protein (gravin) 12* 


Hs.788 


AKAP12 


1.9 


0.1 


metallothionein IX* 


Hs.380778 


MT1X 


1.9 


0.8 


creatine kinase, mitochondrial 2 (sarcomeric) 


Hs.80691 


CKMT2 


1.8 


0.6 
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serum-inducible kinase 
CGI-130 protein 

guanine nucleotide binding protein (G protein), 
alpha inhibiting activity polypeptide 1 
related to the N terminus of tre* 
solute carrier family 12 (sodium/potassium/chloride 
transporters), member 2 
Human clone 23612 mRNA sequence 
ankyrin repeat and SOCS box-containing 4 
apolipoprotein B mRNA editing enzyme, catalytic 
polypeptide-like 3C 

cellular retinoic acid binding protein 2* 
KIAA0657 protein* 

phosphodiesterase 4D, cAMP-specific 
(phosphodiesterase E3 dunce homolog, Drosophila) 
autism susceptibility candidate 2 
hairy/enhancer-of-split related with YRPW motif 2* 
immediate early response 5 
E3 ubiquitin ligase SMURF2 
ADP-ribosylation factor-like 7* 
Ras and Rab interactor 2* 
GS3955 protein, Tribbles homolog 2 

metallothionein 1L 
glutamate receptor, metabotropic 8 
klotho 

calmoduUn-like 3 
integrin, alpha 1 

lymphoid enhancer-binding factor 1 
epithelial V-like antigen 1 
likely ortholog of mouse limb-bud and heart 
gene* 

insulin induced protein 2 
patched homolog (Drosophila) 
chemokine-like factor super family 6 
lipoma HMGIC fusion partner 
transforming growth factor, alpha 
Homo sapiens mRNA; cDNADKFZp762M127 
(from clone DKFZp762M127), mRNA sequence 
cyclin I 

hyaluronan synthase 2 

IQ motif containing GTPase activating protein 1 

zinc finger protein 216 
cDNADKFZp564O0122 
aryl hydrocarbon receptor 
neuroepithelial cell transforming gene 1 
sterol-C4-methyl oxidase-like 
tubulin, alpha 3 

BCG-induced gene in monocytes, clone 103 
cathepsin B 
keratin 6A 



Hs.3838 


SNK 


1.8 


0.1 


Hs.32826 


CGI-130 


1.8 


0.1 


Hs.203862 


GNAI1 


1.8 


0.4 


Hs.278526 


RNTRE 






Hs.l 10736 


SLC12A2 


1.7 


0.3 






1.7 


1.0 


Hs.248062 


ASB4 


1.7 


0.8 


Hs.8583 


APOBEC3 


1.7 


0.1 




C 






Hs.183650 


CRABP2 


1.7 


0.1 


Hs.6654 


KIAA065 


1.7 


1.1 


Hs. 172081 


7 

PDE4D 




0.1 


Hs.32168 


AUTS2 


1.6 


0.4 


: Hs. 144287 


HEY2 


1.6 


0.0 


Hs.15725 


EER5 


1.6 


0.1 


Hs.194477 


SMURF2 


1.6 


0.4 


Hs.111554 


ARL7 


1.6 


1.0 


Hs.62349 


RTN2 


1.6 


0.4 


Hs.155418 


TRB2 


1.6 


0.5 


Hs.448357 


MT1L 


1.5 


0.6 


Hs.86204 


GRM8 


1.5 


0.2 


Hs.94592 


KL 


1.5 


0.1 


Hs.239600 


CALML3 


1.4 


0.6 


Hs.l 16774 


ITGA1 


1.4 


0.1 


Hs.44865 


LEF1 


1.4 


0.4 


Hs.l 16651 






0.1 


Hs.57209 


LBH 


1.4 


0.1 


Hs.7089 


ISG2 


1.4 


0.2 


Hs. 159526 


PTCH 


1.4 


0.1 


Hs.380627 


CKLFSF6 


1.3 


0.3 


Hs.93765 


LHFP 


1.3 


0.4 


Hs.170009 


TGFA 


1.3 


0.4 


- 




1.3 


0.6 


Hs.79933 


CCN1 


1.3 


0.1 


Hs.159226 


HAS2 


1.3 


0.5 


. Hs.1742 


IQGAP1 


1.3 


0.5 


Hs.406096 


ZNF216 


1.3 








1.3 


0.2 


Hs. 170087 


AHR 


1.2 


0.6 


Hs.25155 


NET1 


1.2 


0.1 


Hs.239926 


SC4MOL 


1.2 


0.1 


Hs.433394 


TUB A3 


1.2 


0.1 


Hs.284205 


BIGM103 


1.2 


0.0 


Hs.297939 


CTSB 


1.2 


0.0 


Hs.367762 


KRT6A 


1.2 


0.4 
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paraoxonase 2 


Hs. 169857 


PON2 


1.2 


0.4 


suppressor of cytokine signaling 5 


Hs. 169836 


SOCS5 


1.2 


0.4 


KIAA0877 protein 


Hs. 11217 


KIAA0877 




0.2 




Hs.80741 


PCCA 


1.2 


0.2 


solute carrier family 2 


Hs.7594 


SLC2A3 


1.2 


0.1 




Hs.22891 


SLC7A8 


1.2 


0.1 


Homo sapiens mRNA; cDNA DKFZp762M127 






1.2 


0.1 


aryl hydrocarbon receptor nuclear translocator-like 


Hs.74515 


ARNTL 


1.1 


0.3 


DnaJ (Hsp40) homolog, subfamily B, member 6 


Hs.181195 


DNAJB6 


1.1 


0.3 


hypothetical protein FLJ21276 




FLJ21276 


1.1 


0.1 


integrin,~~beta 5 ' 


Hs. 149846 


ITGB5 


1.1 


0.1 


PTK7 protein tyrosine kinase 7 


Hs.90572 


PTK7 


1.1 


0.3 


transforming growth factor, beta receptor II 


Hs.82028 


TGFBR2 


1.1 


0.1 


Homo sapiens cDNA FLT25 134 fis 


Hs.301306 




1.1 


0.0 


DKFZP564A2416 protein 


Hs.5297 


DKFZP56 


1.1 


0.1 




4A2416 






dual specificity phosphatase 6 


Hs. 180383 


DUSP6 


j 1 


0.4 


midline 1 (Opitz/BBB syndrome) 


Hs.27695 


MIDI 


1.1 


0.1 


membrane protein, palmitoylated 1, 55kDa 


Hs.1861 


MPP1 


1.1 


0.1 


LIM domain protein 


Hs.424312 


RBL 


1.1 


0.1 


SH3-domain binding protein 5 (BTK-assodated) 


Hs.109150 


SH3BP5 


1.1 


0.1 


SIPL protein 


Hs.64322 


SIPL 


1.1 


0.1 


tumor protein D52-like 1 


Hs.16611 


TPD52L1 


1.1 


0.4 


3-hydroxy-3-memylglutaryl-Coenzyme A reductase 


Hs.11899 


HMGCR 


1.0 


0.1 


homeo box B7 


Hs.819 


HOXB7 


1.0 


0.1 


HTV-1 Tat interactive protein 2, 30kDa 


Hs.90753 


HTATIP2 


1.0 


0.1 


insulin receptor substrate 2 


Hs.143648 


IRS2 


1.0 


0.1 


tubulin beta-5 


Hs.274398 


TUBB-5 


1.0 


0.0 


apoptosis antagonizing transcription factor 


Hs.16178 


AATF 


1.0 


0.1 


E2F transcription factor 3 


Hs.1189 


E2F3 


1.0 


0.1 


hypothetical protein FLJ12542 


Hs.236940 


FLJ12542 


1.0 


0.1 


phafin 2, Pleckstrin homology domain containing, 


Hs.29724 


PLEKHF2 


1.0 


0.1 


family F member 2 










proline 4-hydroxylase 


Hs.3622 


P4HA2 


1.0 


0.1 


Homo sapiens G21VN02 mRNA, mRNA 


Hs.324787 




1.0 


0.1 


sequence, solute carrier family 5 (inositol 




SLC5A3 






transporters), member 3 









EXAMPLE 5 

ABLATION OF PROX-1 LEADS TO DIFFERENTIATION THROUGH TJP- 
5 REGULATION OF NOTCH SIGNALING IN THE SW480R CELLS. 

Activation of B-catenin/TCF pathway plays a central role in colon 
tumorigenesis (Giles, R. H., van Es, J. H. & Clevers, H., Biochim Biophys Acta 1653: 
1-24, 2003). Of interest for this study, suppression of B-catenin/TCF signaling in 



WO 2005/014854 PCT/EP2004/008819 



colon cancer cells decreases the levels of c-myc, increases p21Cipl levels and induces 
cell cycle arrest (van de Wetering et al., Cell 1 1 1:, 241-50, 2002). However, 
suppression of Prox-1 did not affect the activity of P-catenin/TCF-responsive reporter 
or nuclear localization of P-catenin. Moreover, an increased expression of several |3- 
5 catenin/TCF-4 target genes, such as CD44, ENC1 and Id2 was observed in the 

absence of Prox-1 (Table II and not shown). These data suggest that Prox-1 may act 
via an alternative pathway to promote growth of colon cancer cells, and that both B- 
catenin/TCF activation and overexpression of Prox-1 are necessary for cell 
transformation. Accordingly, contemplated herein are methods of alleviating 

10 colorectal cancer whereby a Prox-1 suppressor is administered in combination with a 
B-catenin/TCF signaling inhibitor. 8-catenin/TCF signaling inhibitors may include 
dominant negative forms of TCF-4, siRNAs and microRNAs targeting TCF-4, B- 
catenin, and c-myc, as well as small molecules that would interfere with binding of B- 
catenin to TCF-4 or TCF-4 to target DNA sequences. Protocols for making these 

15 types of inhibitors are detailed above with respect to Prox-1 inhibition. 

The DNA and protein sequences for B-catenin (SEQ ID NOs: 10 and 
11, respectively) are published and disclosed as Genbank Accession Number 
NM_001904. The DNA and protein sequences for TCF-4 (SEQ ID NOs: 12 and 13, 
respectively) are published and disclosed as Genbank Accession Number 

20 NML003199. Related to the B-cateriin/TCF signaling pathway is the APC gene, the 
sequence of which is publicly available as Genbank Accession Number NM_000038. 
The DNA and amino acid sequences for APC are also provided herein as SEQ ID 
NOs: 42 and 43, respectively. The DNA and protein sequences for C-myc (SEQ ID 
NOs: 44 and 45, respectively) are published and disclosed as Genbank Accession 

25 Number NM_002467. 

Notch signaling has been shown to be essential for the generation of 
cell lineages in the crypts of the mouse small intestine. High levels of Notch are 
thought to suppress the expression of the basic helix-loop-helix transcription factor 
Mathl via the induction of the transcriptional repressor Hesl, which will lead to the 

30 differentiation of progenitor cells into enterocytes. Conversely, high levels of Mathl 
result in the differentiation towards the neuroendocrine, Goblet and Paneth cell types 
in the small intestine (Jensen, J. et al., Nat Genet 24: 36-44, 2000; Yang, Q., 
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Bermingham, N. A., Finegold, M. J. & Zoghbi, H. Y., Science 294: 2155-8, 2001). 
Among Notch signaling components, Notch2 and its target transcription factor Hesl 
levels are higher in SW480A cells in comparison with the SW480R cells, suggesting 
that this pathway is functionally active in these cells. Interestingly, SW480R cells 
5 express higher levels of Notch ligand Jagged2. Suppression of Prox-1 resulted in up- 
regulation of the Notch ligand Jagged 1 and the direct target of the Notch pathway, the 
transcription factor Hey2, whereas the expression of Jagged2 and prostaglandin D2 
synthase, previously shown to be negatively regulated by Notch signaling was 
suppressed (Fujimori, K. et aL.JBhl Chem 278: 6018-26, 2003). SW480R cells 

10 were transfected with GFP siRNA or Prox-1 siRNA and GFB 1-luc, TOPFlash or 
control FOP flash reporters. Firefly luciferase activity was normalized to Renilla 
luciferase activity. Up-regulation of Notch-responsive reporter GBFl-luc was 
observed in SW480R cells transfected with Prox-1 siRNAs. Accordingly, 
contemplated herein is a method of alleviating the symptoms of colorectal cancer 

15 comprising the administration of a Prox- 1 suppressor in combination with a Notch 
agonist or target transcription factor. Notch agonists include Jagged 1, Jagged2, 
Deltal, Delta3, Delta4, and Serrate. Target Notch transcription factors include Heyl, 
Hey2, and Hesl. 

The DNA and protein sequences for Notch- 1 (SEQ ID NOs: 14 and 15, 
20 respectively) are published and disclosed as Genbank Accession Numbers 

NM_017617. Likewise, the DNA and protein sequences for various forms of Notch 
(including 2-4) are publicly available and included herein as SEQ ID NOs: 16-21. In 
addition, the DNA and protein sequences for various ligands of Notch (including 
Jaggedl, Jagged2, Jagged2 (transcript variant 2), Deltal, Delta3, Delta4, and Jagged2 
25 (transcript variant 1)) are publicly available and included herein as SEQ ID NOs: 22- 
35, respectively. DNA and protein sequences for target Notch transcription factors 
Heyl, Hey2, and Hesl are also publicly available and are included herein as SEQ ID 
NOs: 36-41, respectively. 

EXAMPLE 6 

30 SUPPRESSION OF PROX-1 INHIBITS GROWTH IN SOFT AGAR. 

Since anchorage-independent growth is one of the hallmarks of 
malignant transformation, experiments were conducted to assess the effects of Prox-1 



WO 2005/014854 



PCT/EP2004/008819 



suppression on the growth of SW480R cells in soft agar. SW480R cells were 
transfected with GFP siPRNA, Prox-lsiRNA A16 or Prox-1 siRNA A25 repeatedly 
over an 8-day period, or left untreated, and seeded in soft agar in triplicate. The 
number of colonies was scored after two weeks of growth. Transfection with both 
5 Prox-1 siRNAs but not the control GFP siRNA significantly decreased the number of 
colonies formed after two weeks of growth in soft agar (Fig. 3A). 

EXAMPLE 7 

REGULATION OF PROSTAGLANDIN BIOSYNTHESIS BY PROX-1 

COX-2 is a key enzyme involved in the conversion of arachidonic acid 
10 into the prostaglandin precursors PGG2 and PGH2, which are further transformed into 
biologically active prostaglandins by the action of corresponding synthases. 
Prostaglandins acts through binding to the G-protein coupled prostanoid receptors and 
they are rapidly inactivated by the action of 15-prostaglandin dehydrogenase (15- 
PGDH). COX-2 is overexpressed in the majority of colorectal cancers and in about 
15 half of colonic adenomas, suggesting that the increased PG production is important 
for tumor growth. In support of this view, treatment with non-steroid anti- 
inflammatory drugs, which acts as inhibitors of COX-2, significantly reduces the risk 
of developing colon cancer (Gupta, R. A. & Dubois, R. N., Nat Rev Cancer 1: 11-21, 
2001). Accordingly, contemplated herein is a method of alleviating colorectal cancer 
20 via the administration of a Prox-1 suppressor in combination with a COX-2 inhibitor. 
Cox-2 inhibitors may include the following non-steroid anti-inflammatory drugs: 
asprin, rofecoxib, celecoxib ,amidophen, analgin, anapyrin, feloran, indomethacin, 
paracetoamol, piroxicam, sedalgin, diclofenac sodium, ketoprofan, Acular®, Ocufen®, 
and Voltarol®. 

25 Experiments were conducted which found that suppression of Prox-1 

in SW480R cells resulted in the up-regulation of the expression of 15-PGDH and 
downregulation of prostaglandin D2 synthase, whereas overexpression of Prox-1 in 
SW480F cells down-regulated 15-PGDH and up-regulated PGD2 synthase 
(Affymetrix results). These data suggest that Prox-1 may be important for the control 

30 of the balance of the total PG production in tumor cells, i.e., in the presence of Prox-1 
decreased expression of 15-PGDH will result in higher net amounts of biologically 
active prostaglandins and enhanced tumor growth. 
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Because SW480 cells do not express COX-2, contemplated herein are 
experiments to assess the effects of Prox-1 on prostanoid biosynthesis in the SW480F 
cells stably transfected with COX-2 or in the cell line which is known to express this 
enzyme, such as HCA-7. To generate COX-2 expressing cells, SW480F cells are 
5 transfected with a mixture of a COX-2 expressing vector and the plasmid bearing 
hygromycin resistance gene, such as pCDNA3.1hygro (Invitrogen) using 
Lipofectamine 2000, as described in Materials and Methods, and the stable clones are 
selected using 200 ug/ml hygromycin B (Calbiochem) over a period of 2-3 weeks. 
Individual clones are isolated and the expression of COX-2 protein is tested using 

10 Western blotting. Functionality of COX-2 may be further verified in COX-2 

expressing clones in comparison to the control cells, using ELISA to monitor PGE2 
production according to the manufacturer's instructions (Cayman Chemical). To test 
the effects of Prox-1 on prostaglandin biosynthesis, COX-2 expressing cells can be 
infected with AdProx-1 or the control AdGFP virus, as described previously (Petrova 

15 et al., Embo J. 21: 4593-9), and the amount of biologically active PGE2, and total 
amount of metabolized PGE2 in cell conditioned medium, determined by ELISA 
(Cayman Chemicals). If overexpression of Prox-1 increases levels of the biologically 
active PGE2 in vitro, contemplated herein are studies to assess the link between Prox- 
1 overexpression and prostanoid biosynthesis in vivo. SW480R or HCA-7 stably 

20 overexpressing 15-PGDH will be produced using the protocol described above, and 
the tumorigenic potential of these cells in nude mice will be determined. In addition, 
contemplated are studies regarding the effects of the treatment with 15-PGDH 
inhibitor on growth of Prox-1 expressing or control xenografts in nude mice. 

EXAMPLE 8 

25 EFFECTS OF NOTCH SIGNAL TRANSDUCTION 

To investigate the effects of altered Notch signaling in SW480R cells 
described herein are experiments that overexpress constitutively active Notchl, 
Notch2, Notch3, and Notch4 intracellular domains, as well as Jagged 1, soluble 
Jagged 1, and Jagged2 using recombinant adenoviruses. Replication-deficient 
30 adenoviruses for the expression of constitutively active Notch 1-4 intracellular 

domains, and Notch ligands Jaggedl, Jagged2, Deltal, Delta3, Delta4, and Serrate are 
produced. SW480R cells are infected with adenoviruses. 48-72 h postinfection cells 
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are seeded in soft agar as described previously, and the number of colonies are scored 
after two weeks in culture. In parallel, total RNA is isolated and analysis of gene 
expression changes is conducted using Affymetrix® microarray according to the 
previously described procedures. If overexpression of Notch or its ligand results in 
5 the inhibition of cell growth in soft agar, further studies are conducted to investigate 
the effects of activation of Notch signaling on growth of tumors in nude mice. 

EXAMPLE 9 

EFFECTS OF PROX-1 SUPPRESSION ON SW480R IN NUDE MICE 

Also contemplated herein are studies to assess the effects of Prox-1 
10 suppression on the growth of SW480R tumors in nude mice. Nu/nu mice can be 
inoculated subcutaneously or intraperitoneally with 1-5 x 10 6 cells/mice using 
SW480R cells transfected with GFPsi RNA or Prox-1 siRNA, or transduced with the 
adenoviruses described in Example 8. Tumors are allowed to grow for 3-5 weeks, 
and tumor size measured twice a week. Animals are sacrificed by cervical 
15 dislocation, tumors excised, and processed for immunohistoshemical staining. The 
tumor histology, expression of differentiation markers, proliferation index and 
vascularization monitored using the antibodies against KI67 (proliferation), mucin, 
galectin-2, p21cipl (differentiation), PECAM-1 and vWF (blood vessel markers), and 
the standard immunostaining protocols. 

20 To assess of the effects of Prox-1 -dependent genes, such as 15-PGDH, on 

prostaglandin metabolism and tumor growth in vivo, SW480R or HCA-7 cells 
recombinantly overexpressing 15-PGDH and control cells, are implanted 
subcutaneously into the nu/nu mice, and tumor growth and differentiation studied. In 
order to confirm the specificity of 15-PGDH effects, a subset of the control and 15- 

25 PGDH overexpressing tumor-bearing animals are treated with the 15-PGDH inhibitor 
CAY10397, administered intravenously, or in drinking water. 

EXAMPLE 10 

ANALYSIS OF PROX-1 IN NATURAL COLORECTAL TUMORS 



30 



Experiments to asses the expression of Prox-1 in a mouse model of 
human familial adenomatous polyposis, Ape min/+ are also contemplated herein. The 
Ape min/+ mice bear a truncating mutation in one allele of Ape gene, and develop 
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multiple intestinal polyps, which further progress to adenocarcinoma. Mice are 
commercially available from JAX. As another cancer model, SMAD3 deficient mice, 
which develop invasive colorectal cancer, is available. The DNA and protein 
sequences for APC (SEQ ID NOs: 42 and 43, respectively) are published and 
5 disclosed as Genbank Accession Number NM_000038. 

Administration of a Prox-1 inhibitor and a placebo to mice of the 
above-described models is also contemplated. Prox-1 inhibitors and administration 
thereof are described herein. Prox-1 inhibitors available for administration include, 
but are not limited to, antisense oligonucleotides, siRNA constructs, or dominant 

10 negative proteins. Monitoring of the mice post-administration is contemplated to 
evaluate the effects of adenocarcinoma and colorectal cancer development and 
growth. Among the results are measurements of the speed of tumor growth in mice 
that received the Prox-1 inhibitor versus mice that received the placebo, thus, 
providing a beneficial efficacy model for the particular Prox-1 inhibitor. Also 

15 contemplated are methods for screening Prox-1 levels in family members with 

familial adenomatous polyposis. Methods for screening Prox-1 levels are described 
herein. Administration of a prophylactic to protect from progression, or the onset of 
cancer, is contemplated where elevated levels of Prox-1 are observed. 

EXAMPLE 11 

20 DETECTION OF PROX-1 PROTEIN EXPRESSION IN COLORECTAL 
CANCER 

As described above, measuring Prox-1 protein expression in colon 
tissues may be a useful tool for diagnosing colon cancer and/or premalignancies. 
Prox-1 mRNA can be detected in colorectal cancer tissues as described in Example 2. 
25 The following prospective example may be conducted to determine whether Prox-1 
protein correlates with Prox-1 transcript expression in colorectal cancer tissue. The 
immunohistochemical analysis can be carried out as follows using an anti-human 
Prox-1 antibody directed against the human Prox-1 peptide, as described in Example 
1. 

30 The tissues for screening are snap frozen in liquid nitrogen after 

dissection, embedded in OCT compound, and sectioned. Sections are fixed on -20°C 
methanol for 10 min, and processed for staining. 
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To enhance epitope recovery, the tissues may undergo steam induced 
epitope recovery with a retrieval solution, including several different SHIER solutions 
with and without enzyme digestion at two different concentrations. The tissues can 
then be heated in the capillary gap in the upper chamber of a Black and Decker 
5 Steamer as described in Ladner et al. (Cancer Research, 60: 3493-3503, 2000). 

Automated immunohistochemistry is carried out with the TECHMATE 
1000 or TECHMATE 500 (BioTek Solutions, Ventura Medical System). 
Specifically, the tissues are blocked with 3% and 10 % normal goat serum for 15 and 
30 minutes respectively. Subsequently, the tissues are incubated with the primary 
10 antibody (anti-Prox-1 antibody) for 60 minutes at 3.0 *g/ml. The tissues are stained 
with the biotinylated goat-anti-rabbit IgG secondary antibody for 25 minutes. 
Optimal results are obtained with overnight incubation. To ensure the staining 
procedure is working appropriately, anti-vimentin is used as a positive control and 
rabbit IgG is used as a negative control. 

15 The antibody binding is detected by an avidin-biotin based tissue 

staining system where horse-radish peroixidase is used as a reporter enzyme and DAB 
(3,3'-Diaminobenzididine Tetrahydrochloride) is used as a chromogen. Specifically, 
the endogenous peroxides are blocked for 30 minutes, the avidin-biotin complex 
reagent is added and then the tissues are incubated in DAB for a total of 15 minutes. 

20 Finally, the tissues are counterstained with hemotoxylin to assess cell and tissue 
morphology. 

The slides are mounted in Aquamount, and the tissues are examined 
visually under a light microscope. Tissue that is positive for increased Prox-1 protein 
expression as compared to healthy colon tissue, or other cancer tissues, indicate 
25 colorectal cancer and/or premalignant lesions. 

While this prospective example provide one means of detecting colon 
cancer, other means will be obvious to those with skill in the art. Various options for 
detecting Prox-1 expression, and, therefore screen for colon cancer, include, among 
others, ELISA-based techniques and Western blotting techniques. 
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EXPRESSION PATTERN OF PROX-1 IN NORMAL COLONIC 
EPITHELIUM 

Studies were conducted to compare Prox-1 expression in normal 
cololnic epithelium. In normal colonic mucosa, all Prox-1 expressing cells were 
positive for the intestinal epithelial transcription factor CDX2. There was no overlap 
with the expression of MUC2, expressed by the goblet cells; however, a subset of 
Prox-1 positive cells also expressed the pan-neuroendocrine marker chromogranin A. 
Also observed was weaker but significant Prox-1 expression in the bottom of the 
crypts below the cell proliferation zone identified by staining for the Ki67 antigen. 

Colonic epithelium is composed of the slowly dividing stem cells 
located in the bottom of the crypt, the cell proliferation zone with transient amplifying 
cells, which give rise to the three main colonic epithelial cell types, and terminally 
differentiated cells, located in the upper part of the crypts. The location of Prox-1 
positive cells at the base of the crypts, therefore, corresponded to the position of the 
intestinal stem cells. (Bach, S. P., Renehan, A. G. & Potten, C. S., Carcinogenesis 21, 
469-76 (2000); Potten, C. S., Kellett, M., Roberts, S. A., Rew, D. A. & Wilson, G. D., 
Gut 33, 71-8 (1992)) A similar staining pattern was observed in the murine 
descending colon, whereas the duodenal epithelium was negative for Prox-1 . 
Expression of p2l cn>1/WAF1 marks the differentiated compartment of colonic crypts 
independently of the cell type (Doglioni, C. et al, J Pathol 179, 248-53 (1996)). 
Accordingly, studies were conducted regarding the expression of Prox-1 in relation to 
p21 ciPi/WAFi An pj-ox.i positive cells located at the bottom of the crypts were 
negative for p21 CIP1/WAF1 ; however, most of the rare Prox-1 positive cells present in 
the upper parts of the crypts were also negative for p 21 CIP1/WAF1 , demonstrating a 
mutually exclusive relation between Prox-1 expression and terminal differentiation. 
p21(CIPl)/(WAFl) (CDKN1) sequences are published and disclosed as Genbank 
Accession Numbers NM_078467 and NM_000389. These variants (1) and (2) encode 
the same protein. 

Based on the data implicating Prospero/Prox-1 in cell fate 
determination in other cell types, and on its expression pattern in colonic epithelial 
cells it is contemplated that Prox-1 may be involved in the regulation of the 
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neuroendocrine cell fate as well as the stem cell phenotype. This hypothesis is 
supported by the fact that PROX-1 is overexpressed in intestinal neoplasms from 
A p C nun/+ j^jgg and t j iat itg eX p ress i on j s regulated by TCF/G-catenin pathway in vitro 
(see Examples 13 and 14). This hypothesis is also in agreement with previous results 
5 showing that targeted inactivation of Tcf712 gene encoding TCF-4 leads to the 
depletion of intestinal stem cell compartment and loss of neurodendocrine lineage 
(Korinek, V. et al., Nat Genet 19, 379-83 (1998)). 

EXAMPLE 13 

PROX-1 IS OVEREXPRESSED IN INTESTINAL NEOPLASMS FROM 
10 APC* MICE . BUT NOT FROM Ltbp4 ' DEFICIENT MICE 

Studies were also conducted to assess Prox-1 expression in Apc min/+ 
mice. A truncating germline mutation in the Ape gene together with somatic 
inactivation of the remaining wild type allele, lead to abnormal B-catenm/TCF 
signaling in intestinal epithelial cells of Apc lmn/+ mice and development of multiple 

15 intestinal polyps (Luongo, C, Moser, A. R., Gledhill, S. & Dove, W. P., Cancer Res 
54, 5947-52 (1994); Su, L. K. et al., Science 256, 668-70 (1992)). High levels of 
Prox-1 in intestinal neoplasms of Ape 1 "™ 7 * mice were observed. Prox-1 mRNA and 
protein were present in tumor cells with high cytoplasmic and nuclear JJ-catenin 
levels, but not in the differentiating cells of the neighboring normal glands with 

20 membrane localization of 6-catenin. 

Mutation in genes regulating TGFB signaling pathway, such as TGFRII 
and SMAD4 occur in human colorectal cancer, and targeted inactivation of TGF-fll • 
binding protein LTBP-4 leads to colon cancer in mice (White, R. L., Cell 92, 591-2 
(1998); Sterner-Kock, A. et al., Genes Dev. 16, 2264-73 (2002)). Studies were 

25 conducted to assess Prox-1 expression in Ltpb4-/- mice, hi contrast to the results 
from Ape min/+, accumulation of Prox-1 in the colonic adenocarinomas from Ltpb4- 
/- mice, which generally preserve normal distribution of 6-catenin, was not observed. 
These results strongly suggest that Prox-1 is a target of APC/ B-catenin/TCF pathway 
in vivo. Tumors from Ltpb4-/- mice had strongly increased number of lymphatic 

30 vessels, positive both for Prox-1 and LYVE-1. 
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EXAMPLE 14 

PROX-1 EXPRESSION IS REGULATED BY fi -CATENIN/TCF PATHWAY 
AND DNA METHYLATION 

Further studies were conducted using SW480R cell line as an in vitro 
5 model to investigate the role of Prox- 1 in colorectal carcinoma. Suppression of Prox-1 
expression using two different siRNAs (SEQ ID NOS: 4, 5, 6, and 7) did not affect 
the activity of a P-catenin/TCF-responsive reporter, the nuclear localization of |3- 
catenin, or the cellular content of active, non-phosphorylated B-catenin, confirming 
that Prox-1 is not acting upstream of this pathway. In contrast, suppression of B- 

10 catenin using two independent siRNAs resulted in almost complete disappearance of 
Prox-1 mRNA and protein. In line with this finding, suppression of Prox-1 was also 
observed in SW480R cells transfected with dominant negative mutant of TCF4, which 
disrupts 8-catenin/TCF mediated transcription (Morin PJ, et al., Science 1997 Mar 
21;275(5307):1787-90). However, overexpression of p21 cn>1/WAF1 , shown to induce 

15 cell differentiation in colorectal carcinoma cells (van de Wetering, M. et al., Cell 111, 
241-50 (2002)), did not modify Prox-1 levels. Taken together, these data show that 
Prox-1 lies downstream of 6-catenin/TCF4 and upstream of p 2i CIP1/WAF1 . 

Also observed was increased expression of several known p- 
catenin/TCF-4 target genes, such as CD44, ENC1 and ID2 in the absence of Prox-1 

20 (Table II, (Fujita et al, 2001 ; Rockman et al, 2001 ; Wielenga et al, 1999)), while 
others such as p21 CIP1/WAF1 , annexin Al, and OB-cadherin were induced upon 
suppression of either 13-catenin or Prox-1. These results underline the complexity of 
the regulatory cascade initiated by B-catenin/TCF in CRC cells and suggest that 
concerted regulation by Prox-1 and other B-catenin/TCF targets is necessary for, 

25 neoplastic growth. 

Studies were also conducted to compare the activation of B- 
catenin/TCF signaling pathway in SW480R and SW480A cells. The SW480R cells 
had slightly more active B-catenin and displayed a two-fold increase in the activation 
of the TCF-responsive promoter TopFLASH; however, both cell lines clearly 

30 displayed nuclear localization of B-catenin as previously reported (Palmer, H. G. et 
al., J Cell Biol 154, 369-87 (2001)). These observations, together with the fact that 
abnormal B-catenin/TCF pathway signaling is a feature of the majority of colorectal 
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cancer cell lines, suggest that 6-catenin/TCF activation is necessary but not sufficient 
for the induction of Prox-1 expression in colorectal cancer. 

DNA methylation is frequently abnormal in colorectal cancer, and it 
was reported recently that Prox-1 expression is suppressed in human hematological 
5 cell lines due to hypermethylation of CpG islands in intron 1 of Prox-1 (Nagai, H. et 
al., Genes Chromosomes Cancer 2,%, 13-21 (2003)). Treatment of SW480A cells with 
the inhibitor of DNA methyltransferases 5-aza-2'-deoxycytidine did not result in the 
increase of Prox- 1 mRNA, while there was increase in the expression of TIMP3. In 
contrast, 5-aza-2'-deoxycytidine almost completely suppressed Prox-1 expression in 

1 0 S W480R cells, suggesting that, at least in this cell type, the regulation of Prox- 1 by 
DNA methylation is opposite to the one observed in leukemic cells. 

Our finding that DNA demethylation decreases Prox-1 mRNA levels 
suggests the existence of a putative suppressor of Prox-1 transcription, whose 
expression becomes relieved upon treatment with 5-aza-2'-deoxycytidine. Since 5- 

15 aza-2'-deoxycytidine is used for the treatment of human cancers, our data also suggest 
that Prox-1 could be used as marker to identify the colorectal tumors which would 
respond favorably to this drug. Such screening of patients/tumors is intended as an 
aspect of the invention. The role of DNA methylation in the growth of intestinal 
neoplasms was previously demonstrated in mice heterozygous or hypomorphic for 

20 DNA methyltransferase 1, a major enzyme involved in the methylation of DNA. 

These mice do not develop intestinal adenomas when crossed with Apc IIun/+ mice. In 
contrast, they develop lymphomas, demonstrating cell type specific effects of 
decreased DNA methylation for cancerous growth (Gaudet, F. et al., Science 300, 
489-92 (2003), Eads, C. A. et al., Cancer Res 62, 1296-9 (2002)). 

25 EXAMPLE 15 

PROX-1 SUPPRESSION AND OVEREXPRESSION IN COLORECTAL 
CANCER 

To characterize the effects of Prox-1 suppression and overexpression 
in colorectal cancer, stable colorectal cancer cell line clones inducibly expressing 
30 Prox-1 or Prox-1 targeting siRNAs are employed. Cells are implanted into laboratory 
animals, such as nu/nu mice, and tumor growth is studied in control mice and mice 
treated with doxycycline. As an alternative approach, Prox-1 or Prox-1 siRNA 
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expressing lentiviruses are employed to provide long-term expression in colorectal 
cancer cell lines in vitro and in vivo. 

To inducibly suppress and overexpress Prox-1 or Prox-1 siRNAs, 
Prox-1 cDNA was subcloned in pTetOS vector (Sarao and Dumont, Transgenics 
5 Res., 1998) , where it is placed under the control of doxycycline regulated promoter. 
Prox-1 siRNAs were subcloned in pTer vector (van der Wetering et al., Embo 
Reports, 2003). Colorectal carcinoma cells stably expressing tTA activator may be 
transfected with Prox-l/TetOS or Prox-1 siRNS/pTer vectors. Clones may be 
selected in the presence of blasticidine and G480 and further tested for the expression 
10 of Prox- 1 by immunostaining or Prox-1 siRNA by suppression of co-transfected Prox- 
1 in the presence of doxycycline. For production of Prox-1 lentiviruses, Prox-1 
cDNA was subcloned into FUiresGFPW (Lois et al., Science , 2002). For production 
of Prox-1 siRNA lentiviruses, Prox-1 siRNAs 1 and 2 were subcloned into lentiviral 
vector pLL3.7 (Rubinson et al., Nat Genet, 2003). 

15 Sequences of the DNA oligos used in the cloning of pLL3 .7-Prox- 1 : 

sense: 

TGGTCATCTGCAAGCTGGATTTCAAGAGAATCCAGCTTGCAG 
ATGACCTTTTTC (SEQ ID NO 47). 

antisense: 

20 TCGAGAAAAAAGGTCATCTGCAAGCTGGATTCTCTTGAAATCCAGCTTGC 
AGTGACCA (SEQ ID NO 48). 

pLL3.7 PROX1-2: sense: 
TGAGCCAGTTTGATATGGATTTCAAGAGAATCCATATCAAACTGGCTCTTT 
TTTC (SEQ ID NO 49). 

25 antisense: 

TCGAGAAAAAAGAGCCAGTTTGATATGGATTCTCTTGAAAT 
CCATATCAAACTGCTCA (SEQ ID NO 50). 

Inducible Prox-1 targeting short hairpin RNA ("shRNA") expression 
30 may also be achieved via CRE recombinase activated induction system whereby an 
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inactivating staffer DNA sequence surrounded by modified loxP sites is removed 
from an shRNA expression cassette by the CRE recombinase activity, thus activating 
the shRNA expression. Alternatively a similar system may be used to inactivate 
shRNA expression upon introduciton of CRE recombinase. Tiscornia et al PNAS 
5 2004, and Coumoul et al NAR 2004) described these systems. 

shRNA or "short hairpin RNA" is a short sequence of RNA which 
makes a tight hairpin turn and can be used to silence gene expression. This small 
hairpin RNA was first used in a lentiviral vector. (Abbas-Terki T. et al, Hum. Gene 
Ther. 13(18):2197-201 (2002)). shRNA generates siRNA in cells (An DS et al.,Hum. 
10 Gene Ther. 14(12):1207-12 (2003)). 

To study the effects of Prox-1 overexpression in vivo, transgenic mice 
overexpressing Prox-1 under the control of intestinal-specific promoter, such as villin, 
CyplA or FABPi are created using standard techniques. The proliferation and 
differentiation status of intestinal epithelial cells is studied by staining of intestinal 
15 tissues for PCNA, Ki67, CDKN1A mucins, lysozyme, chromogranin A and 

carboxipeptidases II and IV. The crossing of Prox-1 transgenic animals with Apc imn/+ 
mice permits determination of whether Prox-1 overexpression influences the number 
and size of intestinal polyps in this mouse model of colorectal cancer. 

Specifically, for in vivo studies of Prox-1 in intestinal differentiation, 
20 Prox-1 cDNA was subcloned in pl2.4Vill plasmid, which places it under the control 
of 12.4 kb mous villin promoter (Madison et al., J.Biol.Chem.2002, genomic contig 
NT_039 170). The construct may be used for the production of villin- Prox-1 
transgenic mice, which will overexpress Prox-1 at the sites of villin expression, i.e. 
intestinal epithelial cells. Also contemplated is subcloning Prox-1 cDNA into the 
25 vector z/AP (Lobe et al., Dev. Biol, 1999), to be able conditionally express Prox-1 in 
any given tissue. In this approach Prox-1 cDNA is placed between the loxP sites, and 
it is not expressed until Cre recombinase is present in the same cell. Excision of loxP 
sites places the transgene under the control of chicken 15-actin promoter . To achieve 
intestinal specific overexpression of Prox-1 the transgenic animals containing z/AP- 
30 Prox-1 expression cassettes in their genomes may be crossed with villin-Cre mice 
(Madison et al., J.Biol. Chem.2002). The latter approach may be preferable to the 
villin-PROXl overexpression because of potentially higher expression levels of the 
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transgene. Also contemplated in cloning Prox-1 cDNA under the control of rat Fabpi 
promoter (Rottman and Gordon, J. Biol. Chem., 1993, genomic contig NW_047627) 
or CyplA promoter (Sansom et al., Genes Dev., 2004, genomic contig NT_039474). 
The latter promoter has an advantage of being inducible upon administration of 6- 
5 naphtoflavone. All of these transgenic mice are contemplated as aspects of the 
invention. 

EXAMPLE 16 
DOMINANT NEGATIVE MUTANTS OF PROX-1 

Further contemplated herein are dominant negative mutants of Prox-1. 

10 Specifically, a Prox-1 mutant protein lacking the transactivation domains or DNA 
binding domains may act in a dominant negative manner. Experiments to investigate 
this hypothesis may be conducted by producing a truncated form of Prox-1 lacking 
the last 60 amino acids or the first 575 amino acids. Disruption of the DNA binding 
domain entails truncation of the protein to exclude amino acids 572-634 of SEQ ID 

15 NO. 3, based on homology to Prospero (Drosophila). Disruption of the 

transactivation domain entails the deletion of amino acids 635-737. These proteins 
may then be tested for their ability to repress the induction of Prox-1 target genes 
upon co-transfection with the wt Prox-1. If such an effect is observed, the construct 
can be used for the generation of transgenic animals with the purpose of suppression 

20 of Prox-1 effects in vivo, or for the anti- Prox-1 therapies in colorectal cancer. 

The foregoing examples are intended to be illustrative of the invention 
and not intended to limit the claims which define the invention. All patent, journal, 
and other literature cited herein is incorporated herein by reference in the entirety. 

While the invention is described specifically with respect to Prox-1, 
25 there are other genes described in tables herein that are differentially expressed. All 
materials and methods described herein are applicable to the genes described in the 
tables. 
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Claims: 



1. 



A method of screening colon tissue for a pathological 



condition, said method comprising: 



measuring Prox-1 expression in a biological sample that comprises 



5 colon tissue from a mammalian subject, wherein elevated Prox-1 expression in the 
colon tissue correlates with a pathological phenotype. 

2. A method according to claim 1, comprising comparing Prox-1 
expression in the colon tissue to Prox-1 expression in healthy colon tissue, wherein 
10 increased Prox-1 expression in the colon tissue from the mammalian subject 
correlates with a pathological phenotype. 

3.. A method according to claim 1 or 2, further comprising a step, 
prior to said measuring step, of obtaining the biological sample comprising colon 
15 tissue from a mammalian subject. 

4. The method according to any one of claims 1-3, wherein the 
pathological condition is colon cancer, and wherein increased Prox-1 expression in 
the colon tissue is indicative of a cancerous or precancerous condition. 



measuring comprises measuring Prox-1 protein in the biological sample. 

6. The method of claim 5, wherein the measuring comprises 
25 contacting the colon tissue with a Prox-1 antibody or antigen-binding fragment 
thereof. 



20 



5. 



The method according to any one of claims 1-4, wherein the 



7. The method of any one of claims 1-6, wherein the measuring 
comprises measuring Prox-1 mRNA in the colon tissue. 
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8. The method of claim 7, wherein the measuring comprises in 
situ hybridization to measure Prox-1 mRNA in the colon sample. 

5 9. The method of claim 7, wherein the measuring comprises steps 
of isolating mRNA from the colon tissue and measuring Prox-1 mRNA in the isolated 
mRNA. 

10. The method according to any one of claims 1-9, wherein the 

10 measuring comprises quantitative polymerase chain reaction (PCR) to quantify Prox- 1 
mRNA in the colon tissue relative to Prox-1 mRNA in healthy colon tissue. 

11. A method according to any one of claims 1- 10, further 
comprising measuring expression of at least one gene selected from the group 

15 consisting of CD44, Enc 1 , and TD2 in the colon tissue, wherein elevated Prox- 1 
expression and elevated expression of the at least one gene in the colon tissue 
correlate with a pathological phenotype. 

12. A method according to any one of claims 1-11, further 
20 comprising measuring activation of -catenin/TCF pathway in the colon tissue, 

wherein activation of the -catenin/TCF pathway and elevated Prox-1 expression in 
the colon tissue correlate with a pathological phenotype. 

13. A method according to claim 12, wherein activation of the - 
25 catenin/TCF pathway is measured by at least one indicator in the colon tissue selected 

from the group consisting of: mutations in an APC gene; mutations in a -catenin 
gene; and nuclear localization of -catenin. 
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1 4. The method according to any one of claims 1-13, wherein the 
mammalian subject is a human. 



15. A method according to claim 14, further comprising a step of 
5 administering to a human subject identified as having a pathological condition 

characterized by increased Prox-1 expression in colon tissue a composition 
comprising a Prox-1 inhibitor. 



16. Use of a molecule that suppresses expression or activity of 
10 Prox-1 in the manufacture of a medicament for the treatment of colorectal cancer. 



17. A method of inhibiting the growth of colorectal cancer cells in 
a mammalian subject comprising the step of: 

administering to the subject a composition comprising a molecule that 
15 suppresses expression or activity of Prox-1, thereby inhibiting the growth of colon 
carcinoma cells. 



18. A method or use according to claim 16 or 17, wherein the 
molecule suppresses Prox-1 expression. 

20 

19. A method or use according to any one of claims 16-18, 
comprising a step of identifying a mammalian subject with a colon cancer 
characterized by increased Prox-1 expression, wherein the composition is 
administered after the identifying step. 

25 

20. A method or use according to any one of claims 16-19, wherein 
the cancer is selected from a colorectal adenoma and a colorectal carcinoma. 
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21 . The method or use according to any one of claims 16-20, 
wherein the composition further comprises a pharmaceutically acceptable diluent, 
adjuvant, or carrier medium. 

5 22. The method or use according to any one of claims 16-21, 
wherein the molecule comprises an antisense oligonucleotide that inhibits Prox-1 
expression. 

23. The method or use according to any one of claims 16-21, 
10 wherein the molecule comprises micro-RNA that inhibits Prox-1 expression. 

24. The method or use according to any one of claims 16-21, 
wherein the molecule comprises short interfering RNA (siRNA) that inhibits Prox-1 



25. The method or use of claim 24, wherein the siRNA comprises 
at least one nucleotide sequence set forth in SEQ ED NOS: 4, 5, 6, and 7. 

26. The method or use according to any one of claims 16-21, 

20 wherein the molecule comprises a zinc finger protein that inhibits Prox-1 expression. 

27. The method or use according to any one of claims 16-21, 
wherein the molecule comprises a dominant negative form of Prox-1 protein, or an 
expression vector containing a nucleotide sequence encoding the dominant negative 

25 Prox-1 protein. 



28. The method or use of claim 27, wherein the dominant negative 
form of Prox-1 protein has a disrupted DNA binding domain. 
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29. The method or use of claim 27, wherein the dominant negative 
form of Prox-1 protein has a disrupted transactivation domain. 

5 30. The method or use according to any one of claims 16-21, 

wherein the molecule comprises short hairpin RJMA (shRNA) that inhibits Prox-1 
expression. - - 

31. The method according to any one of claims 17-30, wherein the 
10 composition is adrninistered in an amount effective to suppress Prox-1 expression and 

increase Notch 1 signaling. 

32. The use according to any one of claims 16-30, wherein the 
molecule is present in the composition in an amount effective to suppress Prox-1 

15 expression and increase Notch- 1 signaling. 

33. The method according to any one of claims 17-31, wherein the 
composition is administered in and amount effective to increase 15-PDGH activity or 
decrease prostaglandin D2 synthase activity. 

20 

34. The method according to any one of claims 17-31, further 
comprising administering to the subject an inhibitor of the -catenin/TCF signaling 
pathway. 



25 35. The use according to any one of claims 16-30, wherein the 

medicament further includes an inhibitor of the -catenin/TCF signaling pathway. 
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36. The method or use of claim 34 or 35, wherein the inhibitor of 
the -catenin/TCF signaling pathway is dominant negative form of TCF-4. 

37. The method or use of claim 34 or 35, wherein the inhibitor of 
5 the -catenin/TCF signaling pathway targets TCF-4, B-catenin, or c-myc. 

38. The method according to any one of claims 17-31, further 
comprising administering to the subject a COX-2 inhibitor. 

10 39. The use according to any one of claims 16-30, wherein the 

medicament further includes a COX-2 inhibitor. 



40. The method or use of claim 38 or 39, wherein the COX-2 
inhibitor is a non-steroid anti-inflammatory drug. 

15 

41. The method according to any one of claims 17-31, further 
comprising administering to the subject a Notch signaling pathway agonist. 

42. The use according to any one of claims 16-30, wherein the 
20 medicament further includes a Notch signaling pathway antagonist. 

43. The method or use according to claim 41 or 42, wherein the 
Notch signaling pathway agonist is a Notch ligand. 



25 44. The method or use of claim 43 , wherein the Notch ligand is 

Jaggedl, Jagged2, Deltal, Delta3, Delta4, or Serrate. 
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45. The method or use of claim 41 or 42, wherein the Notch 
signaling pathway agonists are Notch targets Heyl , Hey2, or Hesl . 

46. A method of inhibiting Prox-1 function in a mammalian subject 
5 having a disease characterized by Prox-1 overexpression in cells, comprising the step 

of administering to said mammalian subject a composition, said composition 
comprising a compound effective to inhibit Prox-1 function in cells. 

47. Use of an inhibitor of Prox- 1 function in mammalian cells for 
10 the manufacture of a medicament for inhibiting Prox-1 function. 

48. A method of screening for a Prox-1 modulator, comprising 

steps of: 

contacting a test molecule with Prox-1 protein, or a nucleic acid 
15 comprising a nucleotide sequence that encodes Prox-1 protein, under conditions 
which permit the interaction of the test molecule with the Prox-1 protein or nucleic 
acid; 

and measuring interaction between the test molecule and Prox-1 
protein or nucleic acid, wherein a test molecule that binds the Prox-1 protein or 
20 nucleic acid is identified as a Prox-1 modulator. 

49. The method of claim 48. wherein the test molecule comprises a 
protein, a carbohydrate, a lipid, or a nucleic acid. 

25 50. The method of claim 48, wherein the test molecule comprises a 

member of a chemical library. 

5 1 . The method of any one of claims 48-50, comprising measuring 
the binding between the test molecule and the DNA binding domain of Prox-1. 
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52. A method of screening for modulators of binding between a 
DNA and Prox-1 protein comprising steps of: 

a) contacting a DNA with a Prox-1 protein in the presence and in the 
5 absence of a putative modulator compound; 

b) detecting binding between the DNA and the Prox-1 protein in the 
presence and absence of the putative modulator compound; and 

c) identifying a modulator compound based on a decrease or increase 
in binding between the DNA and the Prox-1 protein in the presence of the putative 

10 modulator compound, as compared to binding in the absence of the putative 
modulator compound. 



53. A method of screening for modulators of binding between a 
DNA and Prox-1 protein comprising steps of: 
15 a) contacting a DNA with a Prox-1 protein in the presence and in the 

absence of a putative modulator compound; 

b) detecting binding between the DNA and the Prox-1 protein in the 
presence and absence of the putative modulator compound; and 

c) identifying a modulator compound based on a decrease or increase 
20 in differentiation in the presence of the putative modulator compound, as compared to 

differentiation in the absence of the putative modulator compound. 



54. A method according to any one of claims 48-53, further 
comprising steps of: 

25 contacting a cell from a colorectal cancer or colorectal cancer cell line 

with the Prox-1 modulator; and 

selecting a Prox-1 modulator that inhibits growth of the cell. 
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55. A method according to claim 54, further comprising: 

formulating a composition comprising the selected Prox-1 modulator 
and a pharmaceutically acceptable carrier; 

administering the composition to a mammalian subject having a 
5 colorectal cancer; and 

monitoring the mammalian subject for growth, metastasis, shrinkage, 
- or disappearance of the colorectal cancer. 

56. A small interfering RNA (siRNA) molecule that comprises a 
10 sense region and an antisense region, wherein said antisense region comprises 

sequence complementary to a nucleotide sequence encoding Prox-1 set forth as SEQ 
ID NO: 2, or a fragment thereof, and wherein the sense region comprises sequence 
complementary to the antisense region, or a fragment thereof. 

15 57. The siRNA molecule of claim 56, wherein said siRNA 

molecule comprises two nucleic acid fragments, wherein one fragment comprises the 
sense region and the second fragment comprises the antisense region. 

58. The siRNA molecule of claim 57, wherein said sense region 
20 comprises a 3 '-terminal overhang relative to the antisense region. 

59. The siRNA molecule of claim 57 or 58, wherein the antisense 
region comprises a 3 '-terminal overhang relative to the sense region. 

25 60. The siRNA molecule of claim 59, wherein said 3 '-terminal 

overhangs each comprise 1-5 nucleotides. 

61 . The siRNA molecule of claim 59, wherein said antisense region 
3'-terminal nucleotide overhang is complementary to RNA encoding Prox-1. 



WO 2005/014854 



PCT/EP2004/008819 



-93- 



62. The siRNA molecule according to any one of claims 56-61, 
wherein said complementary sequences are 18-100 nucleotides in length. 

5 63. The siRNA molecule according to any one of claims 56-61, 

wherein said complementary sequences are 18-30 nucleotides in length. 



64. The siRNA molecule according to any one of claims 56-61, 
wherein said complementary sequences are 21-23 nucleotides in length. 

. 10 

65. The siRNA molecule according to any one of claims 56-61, 
wherein said antisense region comprises sequence complementary to sequence having 
any of SEQ ID NOs. 4 and 6. 

15 66. The siRNA molecule according to any one of claims 56-61, 

wherein said antisense region comprises sequence having any of SEQ ID NOs. 5 and 
7. 

67. The use of an siRNA molecule according to any one of claims 
20 56-66 in the manufacture of a medicament for the treatment of colorectal cancer. 

68. The use according to claim 16, wherein the molecule comprises 
a compound comprising a nucleic acid 8 to 50 nucleotides in length, wherein said 
compound specifically hybridizes with a polynucleotide encoding Prox-1, or 

25 hybridizes to the complement of the polynucleotide, and inhibits the expression of 
Prox-1 when introduced into a cell that expresses Prox-1. 
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69. The use of claim 68, wherein the compound is an antisense 
oligonucleotide. 



70. The use of claim 69, wherein the antisense oligonucleotide has 
5 a sequence complementary to a fragment of SEQ ID NO: 1. 



7 1 . The use of claim 70, wherein the fragment of SEQ ED NO: 1 
comprises a promoter or other control region, an exon, an intron, or an exon-intron 
boundary. 



72. The use of claim 70, wherein the fragment of SEQ ID NO: 1 
comprises an exon-intron splice junction. 



73. The use of claim 70, wherein the fragment of SEQ ID NO: 1 
15 comprises a region within 50-200 bases of an exon-intron splice junction. 



74. The method or use according to any one of claims 16-21 , 
wherein the molecule comprises an inhibitor of DNA methyltransferases, thereby 
inhibiting Prox-1 expression. 

20 

75. The method or use according to claim 74, wherein the inhibitor 
of DNA methyltransferases is 5-aza-2'-deoxycytidine. 

76. The method according to any one of claims 22-3 1, further 
25 comprising administering to the subject an inhibitor of DNA methyltransferases. 



77. The use according to any one of claims 22-30, wherein the 
medicament further includes an inhibitor of DNA methyltransferases. 
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78. The method or use of claim 76 or 77, wherein the inhibitor of 
DNA methyltransferases is 5-aza-2'-deoxycytidine. 
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accttacact 


5160 


tgaaacactt 


5220 


ctccccgaaa 


5280 


agtgttcctt 


5340 
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tgaaattctt tactgaaaat 


39467A.txt 

tCtttCCtCC tcLclCCttcLcig 


.txt 

t catcag ttt 


ccttagaatt 




ttgcatgtta aagagaatgt 


cag ataat t c agatattaaa 


ggagactctt 


ttggagtagt 


5460 


taaaacctgt 


tttgattata 


cctggatgtt tattcttcta 


atatcttttt 


ctgggaggaa 




tctgctatgt taagatatgc 


attgtataag aattactaaa 


gcatttgtgt 


aggttatata 


5580 


cgaagtgatg 


caacaaaata 


tttaatgatg aaaaactcta 


tatagacttt 


cacattaatt 


5640 


aaagaggggt ttacaggaat 


agagtaagtg tatccgatca 


ataatacatt 


tgggttcaaa 


5700 


ttctcatcag 


tatttttctg 


catccttgct gatttggaca 


tccaccagtg 


ttgatcaaaa 


5760 


gcttcatatt gcctagtgaa 


actgaaaatt aatgttaaaa 


tgcaaatatg 


atatgcatca 


5820 


ataataattg 


caggtgaaac 


atgatagctt aatacatatc 


ttgagaaata 


aaggagttta 




aaaaatatca 


atgataaagt 


cattccatgg cttcctttaa 


attctgaact 


ggaatatcat 


5940 


ggaagcactt 


gggaaatgtt 


tttaagagat ttaatttata 


ttatggtaac 


gtaacagtac 




attttcttat 


gtggtaaata 


tattcatata gat at cttg t 


ttatgaaatg 


tgatgctaat 




aaagtgctgt 


gtcaaccggt 


tattattatt taatcatg cc 


tatagcttcc 


atgggttatg 




gttccagtgt 


gtgctaccac 


Ld. LdLL L L Id I LLLlddd L L 


aaat ct aag c 


tatatggaga 




gatatattta 


tttgtgccta 


tt aatataat gccttgtcct 


gglclttttl 


atttatctta 


6240 


tttttcccat 


ttgttttgtc 


tt atttgtta tgttccagct 




caacaagacc 


6300 


taaaagtatt 


taaattcttt 


tagcccaaga cagatacaaa 


tcgttattta 




6360 


gttgactgaa 


atagaattac 


aaaattagtt tagtttggtg 


aatatcaagg 


gag t tat at c 


6420 


ttgttcttaa 


cagactccac 


aagcatttct ttccacctta 


ggaagagcac 


agccctcctc 


6480 


ttggctccag 


catggggcag 


ggatgcagct gttgatacct 


aggctagatg 




6540 


cagttgacgc 


agaggtaaat 


ggcagttgga aaaggaagga 


tgcctgggga 


tglccttgtg 


6600 


ctcatcagcg 


acaccagtct 


gtcctttcca agcctctgtg 




tctt cccaca 


6660 


gcaaggatgg 


caggaggaaa 


gtccagtttg ggtgttaggg 


tglac!ggga 


gagaaaaaat 


6720 


actgcaaaaa 


gtttgtttga 


cattttgatt g gag ate cat 


gtgctttgca 


9 9 "t- g 3- g t c 


6780 


aagagaaaag 


gatttgcata 


caaatagaaa agatg taaaa 


tttaaaaata 




6840 


gctctatttt 


ggggaaggtg 


at atacacac agaaaaaagt 


cttccttgta 




6900 


atgcaagtgt 


ttctttgatt 


aacagagctt tgaaatgatt 


catccttttt 


cttgtctcag 


6960 


cctctccttg 


ttctttctgt 




ttatcagatc 


taatgtgttt 


7020 


gtgtagtatt 


tgtcactgca 


tttttgtatt cctgaaacca 


attttattat 


tagtgtttga 


7080 


aagggtctca 


atcattctga 


attcaatttt gaacccaatg 


ttgtagttct 


tgagaactcc 


7140 


atctccattc 


taagttcagg 


aaattttatc ctgaagcatg 


caaaaagtat 


ttcattctca 


7200 


agcatgcaaa 


tatatatata 


tatatatata tatatatata 


tatatatata 


tatatatata 


7260 


taaagaggta 


tcattttgct 


ttcatgatac cctaaagcag 


gctcttttaa 


aatgttttat 


7320 


ctttctatag 


aaaccaggag 


caaagatttc atgaggaaat 


cactgtcact 


taaaaaaata 


7380 
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tacatattgt 


tgccatctaa gcattgagca 


39467A.txt 
ttttcttgat 


.txt 

tt"t"t3C3ggti 


Ld l l n_a l_yi_ 




tgaaattatg 


cctatttgca tggatagtca 


ttctttaaag 


ctagccacag 


atgcagtcct 




agggagcacg 


tagatgtttt tacaggtgaa 




tgggagccgt 


tccagacact 




ctgcatgctg 


cctttggcaa tggaccctgt 


La l i_y l_yaay 


3tg"tgctctg 


ttaagcaaac 




gtgaagttta 


atattagata aacccaacgt 


gaaaaaaatt 


1- 1- r 3 1~ 1 - 1 - 1* r f 


tcataaaatg 




ttaattataa 


acaaaaagat gtgacatctt 


atatgtctac 


3333t"t"tggg 


at tag cat ca 




ctagttaata 


agttacacaa tgtcaagtgc 


cttttatgaa 




ggatgttctc 




tttttatact 


gtgtttccaa gaaacaatgg 




acaaagalat 


atttcccttt 




ctcacacatt 


tgatggacat tattttcttt 


cttctttata 


tatcttcttt 


cagttttttc 


7920 


tgtttttttt 




gaaataaggt 




tgtatgttaa 


7980 


agagtttctt 


tglgclttgg acatattatt 








8040 


ctgaagatat 


3ttt3tCttt 3tctcggtc3 


ataacttatt 


attcctcata 


ttgatttcta 


8100 


aaatgtggta 


3C3"tCCt'tg't "tttgCSgtigS 


atccaacttt 


gtaataattt 


gtcattaaaa 


8160 


ggacattatg 


3333"tgt3t!3 33"t3"tt!C'tt!3 


tagttacatt 


aagatatatc 


aacagatatc 


8220 


atcttcacct 


3"tg3t!"t"t't3C 33gt 333333 


tgcatagcta 


agctaaataa 


gcagacttat 


8280 


aaaatgacta 


ttgtgcattt stttcaatgc 


taaacty acc 


atttatgttt 


gaaagatgct 


8340 




gtgttctcct tcccati"ttt3 


catatgacaa 


aaatattgta 


aaattcaaga 


8400 


!talaagc!c 


"tCt3"tt3"t3"t 3"t"ttgC3"tt"t 


attttagagt 


ccttttcctt 


taatagcgtt 


8460 


3. 3. 3. 3. C C 3. C cL C 


"t33"t"tgt33"t gcagsa&tgc 


aatttttcat 


gtgaatttct 


catagtctca 


8520 




"tti3"t"ttic"t"t3 3g*t3*t3g3gc 


agtttcatct 


tccttataat 


atgaatctca 


8580 


atgcccaaaa 






cttataatct 


actgtttctt 


8640 




acagtatclt ttt^ggcatt 




aatcatatta 


ctggtgctta 


8700 


agclgttttt 


gcttaatttt tttttaatct 


taatccatct 


taaaccagtg 


gagcagaaat 


8760 


3. "t "t 1 3 ;3 3 3 3. t 


gtttcatttc aagcagagtg 


cataataaat 


tgcaataatt 


gtaatgtgcc 


8820 


3"t333"tCCC3 


gagcctatgc attttgcatt 


tgattcagga 


ttgaggtcag 


gaaatttgga 


8880 


g333"t"t"ta33 


gaaaatgatt catcagtcct 


tttgttctgt 


tggccagggt 


cccgggattc 


8940 


t "t 9 3- 9 c "t- 9 9 


cccagctgac gagcttttga 


agatggcaca 


ataaccgtcc 


agtgatgcct 


9000 


93CC3"t93C3 


gcacagccct cttaagccgg 


caaaccaaga 


ggagaagagt 


tgacattgga 


9060 


9"t93333993 


cggtagggac agcatctgca 


ttttttgcta 


aggcaagagc 


aacgtttttt 


9120 


agtgccatga 


atccccaagg ttctgagcag 


gatgttgagt 


attcagtggt 


gcagcatgca 


9180 


gatggggaaa 


agtcaaatgt actccgcaag 


ctg ctgaaga 


999^gaactc 


gtatgaagat 


9240 


gccatgatgc 


cttttccagg agcaaccata 


atttcccagc 


tgttgaaaaa 


taacatgaac 


9300 


aaaaatggtg 


gcacggagcc cagtttccaa 


gccagcggtc 


tctctagtac 


aggctccgaa 


9360 


gtacatcagg 


aggatatatg cagcaactct 


tcaagagaca 


gccccccaga 


gtgtctttcc 


9420 
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ccttttggca 


ggcctactat gagccagttt 


39467A.txt 
gatatggatc 


.txt 

gcttatgtga 


tgagcacctg 


9480 


agagcaaagc 


gcgcccgggt tgagaatata 


atteggggta 


tgagecatte 


ccccagtgtg 


9540 


9 catt 3.3.999 


gcaatgaaaa tgaaagagag 


atggccccgc 




tccccgagaa 


9600 


ag "t "t 3. c a.9 3. 9 




ctt ccccagc 


agcaglaaca 


gagtttccag 


9660 


r a n r 1~ n n T T T 


cagcccgaaa agaacagaag 


y « y a y y a y \_ 


gccgacagct 


gaaacagcag 


9720 


ctggaggaca 


tgcagaaaca gctgcgccag 


ctg caggaaa 


agttctacca 


aatctatgac 


9780 


agcactgatt 


cggaaaatga tgaagatggt 


aacctgt ctg 


aagacagcat 


gegcteggag 


9840 


atcct ggatg 


ccagggccca ggactctgtc 


993-3-ggt cag 


ataatgagat 


gtgegagcta 


9900 


gacccaggac 


agtttattga ccgagctcga 


gccctgatca 


gagagcagga 


aatggctgaa 


9960 


3. 3. C 3. 3. 9 C C 9 3. 


agcgagaagg caacaacaaa 


gaaagagacc 


atgggccaaa 


ctccttacaa 


10020 


ccggaaggca 






tgaacactgc 


catgtcgcaa 


10080 




ctgtggtcaa agtcttttcg 


gccaagccct 


cccgccaggt 


tcctcaggtc 


10140 


ttcccacctc 


tec ag ate cc ccaggccaga 


tttgeagtea 


atggggaaaa 


ccacaatttc 


10200 


cacaccgcca 


accagcgcct gcagtgcttt 


ggcgacgtxa 


tcattccgaa 


ccccctggac 


10260 


acctttggca 


atg tgcagat ggccagttcc 


actgaccaga 


cagaag cact 


gcccctggtt 


10320 


gtccgcaaaa 


actcctctga ccagtctgcc 


tccggccctg 


rcactaacaa 


ccaccaccag 


10380 


cccctgcacc 


agtcgcctct ctctgccacc 


aegggcttea 


ccacgtccac 


cttccgccac 


10440 


CCC"t"tCCCCC 




tttcagagee 


cattaggtgc 


tccctccggc 


10500 


tccttctctg 


gaaaagacag agcctctcct 


gaatccttag 


acttaactag 


ggataccacg 


10560 


agtctgagga 


ccaagatgtc atctcaccac 




acccttgttc 


accagcacac 


10620 


ccgcccagca 


ccgccgaagg gctctccttg 


tegctcataa 


agtccgagtg 


eggegatett 


10680 


caagatatgt 




ggaagtgcaa 


tatcctttta 


ttttcccctc 


10740 


gag gaaaaaa 


caaaccaaaa aaggtttccc 


aaaaggttgg 


gtttacacaa 


tatctagagt 


10800 


aatgtagatt 




acctttccca 


ttattcaaag 


gaataggctt 


10860 


ttatcagcat 


gcgtg tgeca ttcctgattg 


cagaaaagct 


taaaactaag 


ccaacatctt 


10920 


tgcagcttcc 


acaagttgtt cactgccttg 


aggagctcct 


atttaatatg 


tgetttctea 


10980 


gcagtgtttt 


ttttctgctg ttcttcctgc 


attatcttct 


tatccctatc 


tcttaaaaaa 


11040 


aataaagaag 




cagtctcatt 


gtaaatactg 


attgaattct 


11100 


ctcagatatt 


ttttaaagat ggtaagttta 


atagaataag 


gagaaaagtc 


agttttcaga 


11160 


tccctaagat 


cc cat aagaa gaattctcag 


tgtaaaccat 


ctgeaagget 


tctggtccgt 


11220 


ttaaagacag 


cccgatgaaa tcttaggaag 


agegctttae 


aaataaaaaq 


ttgaggagga 


11280 


agaaaaatgg 


atgtgggtgg ggagttagtc 


tctctttcat 


ctttaagtga 


gacttttttt 


11340 


tttaaggaaa 


tatacaggta ctgatttatt 


cagacagcat 


cggtctctct 


cccgttcacc 


11400 


caaggtctgt 


tctttgggtc tggtgcagct 


gcctctatgc 


atgattaacc 


tctgttcagc 


11460 
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Ccl "t 3. C 3. C 39 3. 


aatcttttgt 


39467A.txt 


.txt 

ttattttgga aagcgagaga 


11520 


g c a. c a. a. t "t a. a. 


atataaaact 


cagctgtatt cgacttaaaa 


atggctcttt ttatgattct 


11580 


tttaa.a.t"tct 


gaaactgacg 


tttatgtaga gataacagtt 


atattttttt 


attaggccta 


11640 


tec eg 3. a. etc 


cagctatttt 


taactgaaga tttttttttc 


tctctgtata tcggttcttt 


11700 


ctgta.a.a.ttt 


tttaaaaatc 


1"1"n1-nri1"rri1" 1-nn1"r1"1"1"1"n 


ggagtagtaa aatagtagca 


11760 


tttgggggca. 


ggtggaggca 




gatggatata aaatttagca 


11820 


a. t t a. a. g t t g g 


ctgtgactaa 


atttaggatt ttgagcaatt 


gtcttgatga ctagagattg 


11880 


a.catttt ca.t 






agtgcaaagt 


cccagctatt 


11940 


9 9 *t 9 9 ^ ^ *^ 3- 


tgttttcctg 


gttagtggag gtcgtacttc 


aagccacctc tcaggataat 


12000 


3- 9 "t g "t a. g a.t t 


tctgataggg 


tgaactacta gggccctaat 


catgagtcct 


gcttgggcag 


12060 




agtctctctt 




attgtaacag 


aaagggaaga 


12120 


yddctya i_y i_y 


ggagatttct 


acatatacgt agaaatggag 


ttttagcttg 


gttgttgatt 


12180 






tctaaaattc aatccaccag 


ccatgaatca 


aagctgcacc 


12240 


a. a. g c a. c c a. "t g 


cc tt a cat at 


tataagcagg cagtaaatat 


tgatcaaatg 


attggaatat 


12300 


rn r Tn t* 1" nn t" 

*-y L y L L yy L 


gatgagaaag 


gcaaagtaag aagacacaat 


ggcttgaatg 


gtttttgtgc 


12360 




aagagcatct 


tcagaggttc atgtaaggct 


aatgtctagg 


gctaagaccc 


12420 


cattgcaccc 




ttaacttcat tttgaaccag 


gtagttgtga 


tagtgggttc 


12480 


tttctgtctc 


tctctctctc 


ttacacacac acacacacac 


acacagacac 


acacacacag 


12540 


agtaaagtga 




aattttggtg aatatttaaa 


gatttaatgc 


caggtttcaa 


12600 


aactcctgta 


agtccacact 


aagctcttta gttcaagatg 


ccagtttatg 


gtttttcttt 


12660 


aaattagact 


ttt ca.tta.ta. 


accagatcat tataattatg 


gctgtgcttt 


ttgtttttag 


12720 


tcttcta.gga. 




ttagattgct ttaagtgttg 


gctatgttca 


ttgtctcaac 


12780 




ccccgga.gga 


attttgagga tttgaattga 


aataagttcc 


ttttattttg 


12840 




aaggctttaa 


agaaaatata gttgcttctt 


cttcagaggc 


atgacttctc 


12900 


ctttcttcta. 


"t ca.ct c a.t a. a. c 




tgttgggaaa 


cacccgtgtt 


12960 


c a. t g "t g a. a. a. "t 




cacactcaga attgtttctt 


tcatatagct 


aaataatgtc 


13020 




ggcHttagt 


gattacattt tccacctttt 


ggccttctat 


gctcctattc 


13080 


■f-t~-t--f-r~r~rrrr 
LL L LLCLLLL 


tctactatta 


atacattgca cttttaacca 


tttatctcat 


tggtatatta 


13140 


tttctcagga 


agagtaagat 


aggcaaacaa ccttttctat 


agttcccaca 


attctgaaac 


13200 


cagtgaggat 




gtagagagat tgggcccact 


tttctcctgt 


ctctacctct 


13260 


gtatggcagt 


gtgttcttcc 


cttgatttaa ctgttagtgt 


gtaggcaaaa 


ttctcaagct 


13320 


tttactttga 


agaaatatct 


gggaatcaca gtgagtgatg 


tcttacttca 


attttaggga 


13380 


tacggggcca 


tatatgatcc 


ggttgtacag ttattcctcg 


aaaagatcaa 


tagaaatggg 


13440 


cagaaatgta 


atgaaatggt 


acaactgtga ttgctattat 


tatgttttaa 


tttttcgttc 


13500 
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atggctttcc aaactgttat atataattta atttttcagg aaaaattatc tcccactcca 13560 

aaaggtacca tctgtttttt gaacaaagta gctaagataa gaactattaa gaacaccagc 13620 

ttatcaggtc aacccattct acattcacca cattaaacat atatgttctg taggatagaa 13680 

cacactacct cattatccca tctagtagaa gggaaatagt gaatgtgtat gcaagttaaa 13740 

ctgaatttca gtgcacctgc tccaagggct catgtcttgg attttaaaaa tatgttcagt 13800 

atctttgcaa atgaatctgt ttaatcaaat attaagtttt attcaaattc caaaagaaac 13860 

agtcagccaa ttgcttttct tcatgatgtt ccttgtcatt catcctcttt gcatctcaag 13920 

aaaaatagcc tagtttaggc cccaaacatt tgcatgcacc cagttaaagc acaagaggag 13980 

tagtataagc cgttaagacg tgcaggtgaa gaaattgagc ctgttctctg aaacagccgg 14040 

ctttttctac tcaactttta gggagaatgt tagaaagact tgaagtttag aaaggaaaat 14100 

ggtttagtaa tttgaaatta aaatccaacc aggaaccata gattagaaat gaatttctga 14160 

aatttgaaac catccacaga aattgatctt atacattttt agaagtcttg tggaggctat 14220 

agtacttata ttagctagag caaaacatgt agattaaaga ctaaaagact ttgggctcct 14280 

acactacccc cctcccctga aaaaaattat aaagtaagta aattaaaaaa aaaaaatccc 14340 

tacactacac agccctccga ttatggtgaa cttcctagtg ggagttacga cttgctctat 14400 

cactgtcatt atgtgagaga gtttagatct tttctcccca ttttagtttc tagggggaaa 14460 

acctcttaga aacttagcaa attagggaat aaggcagaac taaaattctt taggtttcaa 14520 

atgttttgga aaatgtaagt agtctcaacc catttgctgg gaactgcagc acgtacaatc 14580 

tctagctaca atccagagtt tagctggaaa aaaagaattt tcttcctccg ctttcacagc 14640 

ttattattct cccatttgcc tttttgctgc ctccgctgct cctcccgtgg ctgctgttta 14700 

ggtaaggtta tattgtactt ggtaaacaga caacacttag gttctcaggt tgtttgaaca 14760 

ctgctttacg ttcagctgca gtaccctgct tctctgatct tttatattcc cgagcagatg 14820 

tctttcatta atttatggat ttatcatctt ttcttttttt tttctttttt cttttttttt 14880 

tttttttaca cctggcagct gtctcaagtt tcaacagtta ttgtctattt tgcattacac 14940 

atagaattga atgtcatctg tcttcacaaa gctatggcta agagaattga ggcacagcca 15000 

catgagctgc tgggacagat cttgtttgcg ttccatcccc cctcacccca ctccccttta 15060 

cctccttaat atttatttgt gctcattttc tttcctggcc ttgaatggag cttagctcgt 15120 

gttcagtaca gctgtatgtt tactgaatct attccatcat gagtcattgt gcgtgtgtaa 15180 

gtatcctgga aacagctagt gctttcttgg aagaacagtt gcttttcagc acaagcactt 15240 

aaaagggaaa ttaaccaatt ggtcagttca gatttatttt gaggagaaaa aaaggattat 15300 

ctaactgttg ccttttaaat gtttcattag ttatttttaa tagtttatta gaaacatata 15360 

ttttatggga attttatctt aattacacaa taagcaagag ataaagatta attctgtgtt 15420 

ccatttcaac tgatcagttc caagtattac caacaggaaa cattttaaag caaaaatgaa 15480 

cttgagaaat ccaaatcaga ataatttttt gttagataaa aagcctctaa atactgatca 15540 
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aaataaaatg 


gatattttac 


39467A.txt 
tttttttaga taaaaagaac 


.txt 

aaaaacatct 


tagcataaat 


15600 


tagatgtatt 


aaaagcttca 


ggaagttttg 


gtagctcagt 


gcccatctaa 


gaaacacaga 


15660 


aaaacacttt 


gtattttgta 


tgacaccaaa ttttaaaaga 


tttgtgactt 


ccaattaaat 


15720 


gcatgacgtt 


gtcttaatgt 


agccatctga aagaaaagat 


tagaacccag 


atctgagagt 


15780 


gtctgtcaaa 


gtttggactt 


gcctaaaact 


cttatcacaa 


ggcagtcgca 


gacagcttgc 


15840 


aactattatt 


tcacttatcc 


atttggacag 


atggtcctga 


agtgtgctgg 


gctcctttag 


15900 


tcttctgtat 


cagtctaatg 


gaggttactg 


gagggccttt 


cagccctctc 


cttggcacaa 


15960 


gaagtatgtc 


agtcataaat 


tatcgtcttt 


gtaatcatta 


aggatctcaa 


acaaaaacac 


16020 


aagttcagtt 


aagctgcttt 


ggcttacaga 


tataaaatca 


aaatttcttt 


ctttagtgtt 


16080 


tattttcagt 


ttaacaaaaa 


ataaaaaaat aaaaaacctg 


cactacttaa 


cttttctatt 


16140 


tacagaccaa 


ggtgatcttt 


ttaaaattgc atgggatatt 


aaagggaatg 


ttaattgaac 


16200 


aaattctcag 


cagaatattt 


ggttaaacac 


cctgttataa 


gtagtcaaga 


gcttatccat 


16260 


attaatttga 


ttatgcttct 


ctagtaactt 


tctggtttcc 


ctccattctt 


aagattagtc 


16320 


acgctagact 


tgatgaaggt 


catttggaaa attttacctt 


tcctaaatat 


ctgtgtttat 


16380 


ttgacatttc 


tgcctaaggg 


gtgaaatttt tgttgggtag 


ttgtgtgagt 


gtgtttgtgt 


16440 


gtgtgtttgc 


acacacaagc 


acactttctt 


ttcttttttt 


tcttattttt 


cttagacact 


16500 


cttctaaaag 


aaaatcctta 


gagaagcttc 


taggaagggc 


ccttaattga 


ccttgtgggg 


16560 


gaccacattg 


attttctcca 


cgtgcatctt 


catttctgat 


aaattataaa 


gccattaatt 


16620 


tgctgaggaa 


atggcagggc 


caggctgcgg 


cacagatgtg 


accagagcca 


tcccagctct 


16680 


gagtctgctg 


aggagtgcca 


agaatctggg 


ggagaatcag 


gaagcctgga 


ttgttatggt 


16740 


tagcctcaca 


ttctcttggg 


aactgtttta 


gttgctgctg 


tttacagatc 


taaaaggtaa 


16800 


tgatgtttcc 


agataaatag 


gccttcttat 


tttgggtaag 


tggccattta 


ttgatctgct 


16860 


aacccacatg 


tattgatttg 


ttagccccaa 


ctactgcgtc 


actctcaaag 


gagttaacta 


16920 


taaatccaag 


acaggcaaat 


tgtatttggt 


tttggaccat 


tgctttcaca 


aaagcaacag 


16980 


ccccctccct 


gtcctctcca 


tgccaaaact 


actcttccca 


agttttagct 


attatttaaa 


17040 


aggaaaaaca 


attaaaagga 


tataataaga 


taaaaagcaa 


gtgagtcaag 


atgctccatt 


17100 


agattaacac 


taaaaggtaa 


aatgtgaaac 


ttgcatagca 


gtgttcaaaa 


taatgcattt 


17160 


tatattttca 


tgtacattag 


tagaataatt 


tgctttaaac 


tgcagagtgt 


ggagagaaga 


17220 


acaaacagaa 


ctgtaattgc 


aaggaagaaa 


aaaaaacctc 


ttatgacaag 


agttgtgtag 


17280 


tacatgttgg 


gtgcatttgt 


ctccttagca 


acaagtgaat 


gtatagatag 


cctaccgacc 


17340 


taaagcaagg 


aaaatatttt 


gccatcctca 


ccctaaagta 


gccaagattc 


tgcaactcaa 


17400 


ttgtgcatcc 


tcaccattgc 


atgtggcaac 


ctctgacagg 


cgacggtcac 


tgagcaaatg 


17460 


gcagcaagtt 


agcaatggat 


gccatagcca 


gtgtcatata 


ccttccagca 


ctcccaccgc 


17520 


agcttgatgg 


acccccagac 


tctatggagg 


tggggactgg 


agggagggag 


gtgggagtcc 


17580 
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ttgtgcttac 


agaattgctt ttccttaacc 


39467A.txt 
aattgcatcc 


.txt 

tacatgeagg 




17640 




ttgaaaaaag caaagctcat 


n -t- 1- 1- -f +"1" 1~a t* 


acccgttatc 


^c^gctccla 


17700 


tat 9 c tgaag 


acctacttct ccgacgtaaa 


ggtagggact 


ttttttattc 


ttddLII L\LL 




r a-H-1-*1"f*1"a1" 
LdXt L LL Ld. L 


gcatgtggca gtaatttgaa 


ct cccggaag 


ttaatggaga 


tgaatgtgga 




a. 1 1 9 9 "t- "t a t 


tcctacacct gtgttataat 


tgatttaatg 


cacttgtctt 


tttgtctaaa 




ggtgtgttaa 


gcaaagatgc cacttgtgta 


tt aagat tgg 




ttaat aagtt 


17940 


gcatgggttt 


ccaatgtagt ctgaaaaact 


tagcctctgt 


ctttatatgt 


ttgagtagct 


18000 


tctttgaaga 


aatttcagct ggtaatggat 


999^9^gctt 


tagagaatgt 


tttttccct c 


18060 


ccctcagcaa 


cagtaaactg tttctgtttt 


tntttrf ntf 


aalgglgcgt 


tatttgtgct 


18120 


t at 9 aaag c a. 


aactctagca cctctttttc 


cccctgtcga 






18180 


tctctatgca 


gtagctgctt aaaaacaaaa 


gtgatgattg 


tctcttattt 




18240 


i:i:yi:i:y l Lyd 


tgtagagtac actgagcata 


aggagaatga 




agattcagga 


18300 


cacat tatt c 


aaatgaggat atgaaagctg 


tcggcctaca 


gctgclgcct 


ccctcattct 


18360 


acagaatatt 


gggacctcct ggttctctct 


gtgtgtg tat 


gcgtgtgtgt 


gtgtgtgtgt 


18420 


gtatgtgt ct 


gtgtctgtgt gtgggtttta 


ag t aatt gtt 




tgatgttgtg 


18480 




gtaacttttt aaaacataga 


ttgggttttg 


atgatgataa 


tgaca cacat 


18540 


ggtatcatta 


tcccaggaac ttgataaaca 


ct acatt age 


tgagattagt 


ttattagggg 


18600 


tgggt gtttt 


ttccccactc ctcccctgcc 




tgt acaag tt 


cttctttctg 


18660 


ccatggagaa 


ctcacaagct gccaaaacac 


act egctett 


ccactgct cc 


ccgcacgcag 


18720 


cttgttttgt 


gcttgatgcc caagtggctt 




attttgeagg 


ccaactcatt 


18780 


tcagtttcct 




cttatHgZ 


aagtt ctgtt 


ttccctcctg 


18840 


t "t t g c t t *t t g 


Uttgtgtlt c^cttclgc 


cttttatctt 


tctccttccc 


tggctgtgct 


18900 


ccttaagtgg 


3-3-9 9 cttgtt ttctccttg t 


t cagcaccag 


caaactgggc 


aagatgggga 


18960 


ggcagggaaa 


gt ccatcacg taaatgtctg 




agtgagcaca 


aacaaggctg 


19020 


agtgacacag 


aggccaggaa aagggtttgg 


gctttgtagl 


ggacaatcta 


gaatacacaa 


19080 


attgaaggca 


atttgtcacc tggttgagga 




tctagagt ct 


agtagaacct 


19140 


ggtaaagttt 


gtcttccagg gaatcctccc 


aaca"t"tt"tag 


ttctaggagg 


ggacatggag 


19200 


gacagggaga 


aaagggttat tgtgtgcaca 


tatgtg'tg'tg 


tgtgtgtctg 


tgtgtgcaga 


19260 


tgtccatgtt 




"tgalzcttcag 


tgttgtgaaa 


taataatgac 


19320 


aataacttat 


attctttgca tagcaatttt 


cacccagaag 


taggecaaag 


agctttacca 


19380 


actgcacaca 


taggtgtcac tcacccacca 




gccacctgga 


999tgggaaa 


19440 


cagcagccat 


tctgagccaa cactacccaa 


cagtagacgt 


caatattaga 


aacaatcatt 


19500 


ttttgtgaga 


gttcaagcat gcgtgcatgt 


gtgtggtgtg 


tggtggcaag 


tggggaagat 


19560 


tattgatctg 


tagctttata aataccatgc 


aatacaaacc 


aacaagaaac 


tgttcccatt 


19620 
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cctctagaat 


gcccctagca 


39467A.txt 
attcagcttt gcaaataacc 


.txt 

actgactctg 


tgtagataac 


19680 


aatggaatac 


ctgggtgaat 


attttatttt caaaagcact 


aatattcaga 


ttgttgattc 


19740 


tatccatacc 


ttacccatac 


tggaagagaa ggctgttaaa 


gtatatgtga 


gtctggttac 


19800 


taccaattat 


ccactgtaat 


ggaggggaaa cagtagaaca 


tatcaggcaa 


agcagaaaat 


19860 


cactgaaggt 


cacttctctt 


ttatttttgg aaggaattat 


acatttttaa 


ctttcctaat 


19920 


tatgtttttt 


ctttggttag 


taataaatga atttgtattt 


cttgagctta 


cactgatgag 


19980 


agtagaaagc 


catgcaaaga 


aagggaaagg tagtccaggc 


aatgtggtcc 


agagactttc 


20040 


cagaaaacaa 


tggcagagca 


ttctgggatt tcttcaatat 


taaggataat 


cacagatgtg 


20100 


aatattgaca 


atgtatacac 


acacatatgt gcatgtgcat 


gggttcacaa 


tacacatata 


20160 


catatataca 


catatctata 


gcttgacatt gacatacaga 


tagacaagtg 


tgtctattta 


20220 


tttgcaaggc 


tgaaagaaat 


agatatttct ttatatatga 


atatacaatc 


caaactttta 


20280 


ttttggccag 


gattcaagaa 


atcactagag aaattgggga 


agagaactta 


gggtcttctc 


20340 


agaaatgaaa 


cctgcatcat 


ttatctggaa caagatatat 


gcatgtatct 


atggaccatg 


20400 


taatgcttgt 


tataatgaca 


tgaggctcta cttggtcatg 


gccacattca 


tctaggagaa 


20460 


aattcctaac 


tttagtaaaa 


tgtactcttt caaataataa 


agttatttta 


ttcaattttt 


20520 


tttttttgag 


acggaatttc 


actcttgtca cccaggctgg 


agtgcaatgg 


tgcaatctca 


20580 


gctcactgca 


acctccacct 


cctgggttca agagattctc 


ctgcctcagc 


ctcccaagaa 


20640 


gctgggatta 


caggaatgtg 


ccaccacgcc tggctaattt 


ttgtattttt 


tttagtagag 


20700 


acggggtttc 


accatgttgg 


cgaagcttgt cttgaactcc 


tgacctcaaa 


tgatctgcct 


20760 


gccttggcgt 


cccaaagtgc 


tgggattaca ggcatgagcc 


accgcgctca 


gccctcatat 


20820 


tttatttagt 


gatcataagt 


tcattttgca agcaaaaaca 


aaaaacaaac 


aacaacaaca 


20880 


acaacaaaaa 


aaaccaggag 


aaaaaaatgt gagcagaaaa 


tatcttgttt 


cctgaatatg 


20940 


gtataacgta 


atggtccatc 


aaagccacac ttggaggata 


gagctagatg 


gggtaaatcc 


21000 


tctgacttgc 


tctagaaggt 


gagtcatgcc aaagtggtgc 


ccactccttt 


gtatttctcc 


21060 


ttaggaatgg 


acacagtgct 


taactctcca caaatgactt 


ccacctgggt 


aagaggtaaa 


21120 


tgcttttcaa 


ttaccttgga 


acgaaagagg tagagggaaa 


tcatacaatt 


cagagatgtt 


21180 


ggcatggcga 


gagttcttct 


tctacagggg tgatgtatat 


gaaggatgaa 


accagggccg 


21240 


acctagttta 


actcctagag 


caagaatcta aacaaagttc 


tatgttctca 


cagagagcca 


21300 


acttaattcc 


ctcataatga 


catttagcca aacaaaaagc 


tcagctcatc 


ggggctacaa 


21360 


atcctttgag 


aaggacaagt 


ggacaaatgt gagagagctg 


ccagggatcg 


atgggccgca 


21420 




gttcactact 


gggtgctgat tttaatgtac 


aaactaataa 


ctcttagacc 


21480 


actaagtaca 


gcagattcag 


tgtcatttta gctttgaaga 


acagacgctc 


acagcttttc 


21540 


aagccggcag 


tgttaaatga 


tgtatctcat tccctccacc 


ccttgagtca 


actgctgcct 


21600 


agccagatta 


aggtgtcaga 


ttgatttgtt ttatacatct 


tttgaccatg 


ctcattgaat 


21660 
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atttaggaag 


tttcttcagc 


39467A.txt 
ccatattgag gctgagatgt 


.txt 


gcattaatca 


21720 


aagtcacaga 


Ittlacacac 


ctgtggaaac acagcctctt 


tattgtagcg 


attagttttt 


21780 


gcagtaacac 




tacagagctt tcctttatag 


aacaattgat 


ccttttcttg 


21840 


t a a. 9 c c a. ct a. 


cagaatgagg 


gaaatt a act ctttaaagtt 


t a a x a c t t x x 


"tctcccccag 


21900 


"t 9 "t 9 3. 3.t a. t c 


tagaaaagcg 


9999cttgct tttgctttta 


9ccggcgact 




21960 


aaattttagt 


tcactt ctcc 


tggagggaaa ccctgtt cct 


Lci yy L L y L L y 


ggctggtcat 


22020 




"t c a t g"t t "t g g 


9 9 3-9 1 ctgtt gtttttgt cc 


attctttctc 


tctggtattt 


22080 


ccatt ct cca 




ttaaat ctcc ctttatgt cc 


cattcgtaaa 


taatggcaag 


22140 




ttt t "t g t c ct 


ccccattagg tcatt cgtga 


cca"t"tc"taga 


aaaaaaatac 


22200 


rrttrtattt 


ttttcctcta 




gacaa"tg"tct 


tgtaacaatg 


22260 


c a 9 3.a 9 c c 3 


atctccatgt 


clllgclatt ttcattcccc 




get at cat 


22320 


tt t gtaatgt 


tttgtttctt 


attctaaaag aattaaaaag 


gaacagtaag 


ccgtcacggg 


22380 


ggcctgtagt 


ccttatctca 


gtgtctggaa atttggacag 


tgtattttac 


tgctgagata 


22440 


aaatggaaag 




t cagcaaatc gtaatgggtt 


taagttctat 


tgaaategge 


22500 


aaccagaaga 


tcagat aatg 


ggggtCCttC agttgt Cttt 


ttaatcgggt 


tccccgcgag 


22560 


gctgaataga 


gacagagcag 




a"t"tct"tgga"t 


aggttaagta 


22620 


c a t 9 t "t"t 9 3- cL 


ctcttigcaag 




ct'taatcatt 


ttctggtcaa 


22680 


ttatctgtaa 


fin n c c c 1" 1" n c 
y y y L ^ ^ L L y v 


aactccatgg caattatgat 


gcaagttggc 


cttttgggag 


22740 




ctctctgctt 


ctgtttcctt gtgacttcca 


ttctctgcca 


taaattttxa 


22800 


ttratttatt 


at c t t "t g c "t a 


gt atagaaac aactttctgt 


gtagtaatta 


gagccccaat 


22860 


a. c a. ca ct t t a. 




<-y >- '-yyc'-y . *-yy a L y LLLL 


catggcctgt 


gtttgataag 


22920 


"t 9 c "t C t t t 9 "t 


tgatttttga 


tgaatg taca t cttttt ctg 




gaaggggatg 


22980 


cctgtgatga 


caaaaggcag 


ggggttgtct gtcagcccgc 


c*tga*ta*taga 


gctatggatt 


23040 


n an ii g g 




gt t gag act c atctgtcctt 


tacgtgagca 


gaggactgtc 


23100 


a. a. t a a. 9 9 3.t 9 


g t a t c at "t t g 


cagtgcatcc agaaagacat 


cttcatttca 


aaggtcatca 


23160 


ggaaaccttg 


gtaaacaaag 


ttttaaggcc taaccatgtt 


atagtaactt 


ggcatttaaa 


23220 


aaaatgtaat 


aaagctcctg 


tctatgccat ctgtgtactg 


tgtcctaacc 


atgcctccca 


23280 


aatggcagag 


ataccaaggg 


3-gggggacat gggt cttat c 


caatgctggc 


ttcaggaagc 


23340 


sggtgaacag 


gcaccaggag 


ctgaccagac ctcaccagac 


atgaatgccg 


tgggcaaaca 


23400 


ttaagtggaa 


tcacag"t"tgg 


3-tggacatgg gaatcactca 


ttgccaaaaa 


aataagcaaa 


23460 


tgccaactcc 


tLLv-d LLL Ly 


tgggaaggcc atttgt ctgc 


attaaaaaaa 


gctgtaatgc 


23520 


ggtgatacaa 


atcctcactt 


aaaaaaaaaa agtatatcaa 


actagtggta 


gagtcatgtg 


23580 


gcacatcacc 


tctggtacat 


gggagtaaca acacttccag 


gattctatgg 


cttcaatgaa 


23640 


tgtccataag 


aagtatataa 


atgcaagttg ttctactgaa 


agatgaagaa 


caatggttaa 


23700 
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aaataaagat 


gttcggctta aggaaagtct 


39467A.txt 
g a ti "t tia g aati 


.txt 

n+-n a r 1-1-1-1- r 
y Ly dL LLL Lt_ 


cactitigaaag 


23760 


gtagagggtt 


gtgatatgat ttccattact 


gacaggfttt 


fata P1-1-1-/-1- 
La Ldd L L L(_ L 


tgtaag t atia 


23820 




tgcctctctt gccaccattt 


tggtiggag"t"t 


aaatacgtat 


ctttccaagti 


23880 


aaagaaggga 


cgggaacatit ciaaaatigctiti 


cagacacttia 


aaaaaatiaaa 


tgaagaaaati 


23940 


ggcaatgttc 


"t"tcL"tCC"t"t"t"t C 3.3. 03.11111! 3. 3. 


a "t t "t a a c a g "t 


tcaacagatg 


c a ti "t a c c "t c "t 




cagctcatca 


3-gtggtittag caatttccgt 


gagttrttact 


acattcagat 


ggagaagtac 


24060 


gcacgtcaag 


ccatcaacga tggggtcacc 


agtactgaag 






24120 


tgtgagctgt 


acagggctct gaacatgcac 




caaatgactt 


tgaggtagga 


24180 


aCTaanCTlTm 




ttcctttttt 


aaaaaattita 


ttttctttag 


24240 


aaatgtaccc 


aaatctgttiti ttig'tgt'tggt 


ttcgcataca 


agcatccccc 




24300 


caggtagagc 


tigtgatigagg agcttccata 


gtccccattg 


gaatcatgag 


gctctglccc 


24360 


actgccattt 


"t"t"tccccati"t ccctggcttt 


t c a g c X. t g t g 




atttggccac 


24420 


agaaaaggga 


ac~tg~tagaati ccaaagaaaa 


atggcagcaa 


gcagcaalgl 


cagagtgatt 


24480 


cattttccaa 


ggaagaggtc cctactccaa 


tagacctttt 


t c a t a "t "t "t a g 


gttctgagag 


24540 


gtcaatgagc 


tgatacatgc tatgtgcaat 


ggtiagctacc 


aatgttattt 


tcttaaaaag 


24600 


tctagaaacg 


"ttgatggggg agtigatcatg 


g "t "t "t c "t g a c "t 


t"tgaca"t"tta 


gtccctttgt 


24660 


ggaggaaatg 


gtatgataat ttactaagta 


catagcataa 


gag at ccatt 


gacatctttt 


24720 


tttgggattt 


"tg"tttc"tgtli tttgttcttt 


"ttiggaggaga 


gactcgtgtg 


ttttgcctaa 


24780 


gtgtaccttc 


acaagcatgc tgctctiiligt 






ttatatatat 


24840 


ctgtgacgtg 


"tatattctag atccacacaa 


agcagcatag 


agaatitccca 


gaaagcaata 


24900 


tccatgcaac 


aatigaaagati gtigtiggctati 


gagtaaggca 


tttctttatg 


ggctaatgtg 


24960 


gtgcctcagc 


aaacag*t"t"t"t ca~tcacaacg 


tgatgactct 


ctgtgagaca 


acactagcaa 


25020 


atctcccagt 


actcacaaag gca"t"tttgct 




ggctgaggca 


acagtagttg 


25080 


gaggtgggaa 


catggcaaga attctgcagg 


rtgZctccc 


"tgatigatigag 


atcagacagg 


25140 


ctgtggcttg 


acaaagttigg "tcca*ttitc"tti 


g tattatctt 


ggctagatigc 


tgtgccatct 


25200 


tgagggtagg 


aa"t"tt!ti"t"tc"t ccaacgtctg 


tgtgcacttg 


gaccttatigt 


taatattctt 


25260 


gctttcttct 


tigtiagatiagg "tatccaggaa 




gttccaaatt 


tcaaaggaaa 


25320 


gaggacacct 


tiggcctcgct ctig"tcaa"t"ta 


3-99 99^- *-"tg a 


cccctagtac 


tcttcctgct 


25380 


tgcccccctc 


cttttttitcg gctcttigtcc 




"tggcaatgca 


gaccagttat 


25440 


agtggcttat 


aaagaattga atatiggaagc 


tcagcaatgg 


ggaagtcata 


gtttttcttt 


25500 


gaaagtttga 


gtagttatag "tgtaagctac 


ctati"ttgtict 


ttgctctcta 


agactaatat 


25560 


attttttgcc 


aaatgtgtga taaatgaagt 


ttgggtggtg 


tgtgtgtgtg 


tgtgtgtgtg 


25620 


tgtgtgtttg 


ctaaatacat taaaagtgag 


aattcttcgt 


gtactgctcc 


actattttaa 


25680 


aatctgtttt 


taaagtctca gttgtaatag 


agcactggct 


cactataatg 


acagagcact 


25740 
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agcaggcttc 


ttctaaagct gaagaatatg 


39467A.txt 
attatgg eta 


.txt 

accattttaa 


agaaatctca 


25800 


ttaaga.gcat 


cttttctccc ctgcctttct 


gctaagcctg 




ccttaagcta 


25860 


ag a. g 3. c "t "t c t 


gtgtgctagt gaattattta 


cattacatga 


tgacataagt 


atctgtttgg 


25920 


c a g c a. "t a. c a. t 


caagcttcat gaaagaattg 


cccaagattc 


atgagatgac 


ttctgcattt 


25980 


ttgcta"ta.ta 




ccttaaagtg 


cgcacgaggg 


ttttcgggtt 


26040 


gctitiaaacct 


tacctggttg gaatttaatc 


cgctacccac 


aggccagggg 


ccaaaatgac 


26100 


acaaacaggg 


gatggctggc atcaggaggt 


acccgacaag 


ctgctccatt 


tagcatcatc 


26160 


taaatcctct 


ttaatatgat taacatctaa 


tatttctctc 


tttgtgaatc 


atatccactt 


26220 


ccagcca.ggc 


cacctctcct ttatctgcag 


tgtctatttt 


aagactgctt 


cactgcaagg 


26280 


ay La iyyyy \~ 


ccgggcagga attttgtcac 


ttctcatgtg 


acttcggaca 


gttattggac 


26340 


ta~t~tctggat 


ctgattcctc cttcagtgaa 


aagaagggaa 


gaaagcagga 


ccatgcagtg 


26400 


tgtcctgccc 


cctctactca cacacttaca 


catccatatg 


cacacacgcg 


taccgaccac 


26460 


cacaca."taat 


cctaatatca cgaaatcgtt 


tttcttttag 


cctctcggtc 


tggctcattt 


26520 




gtttcagata aggtgagccc 


ttcttttccg 


tgcctttgtg 


catggaggtc 


26580 


a. c "t g c t "t a. a. g 


tgagatgctt aaaaagccac 


cgttcttatc 


gtggtagctt 


tgctagtgtg 


26640 


ggccgtggct 


gagagccaaa agtagatccg 


gcaccttcag 


ctgaatacct 


ccactgatac 


26700 




gctttacttt tgtatttaag 


tttctcctct 


taaggtcaag 


taaaatgaac 


26760 


cta."ta.g"t"tta. 


agtattagca agtgaagagg 


atggcaaaat 


ggagaactgt 


gctacaaaca 


26820 


gagct aaa.cc 


atggtagagg gactttgaag 


ctacgtctac 


acggtgcccc 


aagatccagt 


26880 


cga."t t c ca.a.g 


gaatcgtgtc acccagctta 


gtaggagctg 


gtcaaacaat 


aaaatgtctt 


26940 


a. t "t g a. t "C g "t a. 


ttcccagact tctcaatcaa 


ttgttgggaa 


caataataaa 


atagctaaca 


27000 


tttattgact 


gtttactaat gacctaggca 


ctcttctaag 


tgttttacca 


aaatagggct 


27060 


tatttaatgt 


gggtaataat aatgacagtg 


ataccaatat 


aataacaaga 


aaaacttcag 


27120 


tt"tgccca.aa 


gctttactat tcttcaagtt 


attctaactg 


ggcagaggca 


gatcgagcca 


27180 


999^-9 3.gcig a 


aggaggtttg acgtctcttc 


actactactt 


tattccttct 


ttctctcctc 


27240 


t a. c c c c "t "t g "t 


cttctctcag ccttctactc 


ccatctctgc 


ctctgtcaga 


agcttgctag 


27300 




gtcactgctt agcaccacct 


ccgtccagcc 


cctgctgctg 


atggctctca 


27360 


a.ggct gga.ga 


ggctgctgac ccctggccta 


caggaaaata 


aagcagatgg 


ggaaagttta 


27420 


tcagca.gcga 


agagggagtg gcttgcctgc 


tctcctctcc 


tagaccctgc 


atttcctggc 


27480 


c 1 1 "t a t g a.g t 


acaggacctt ctaagtggca 


gtagagcttg 


ttctgccttt 


tgtatcagtt 


27540 




ccagaattct tggcacggtg 


tgcagactta 


oaataataaa 
y yy ^yy L M°y 


cgtttgagaa 


27600 


gacccaaggg 


atgtggaaga agacacccaa 


ggggaaaaat 


acgaaataca 


cttttagttt 


27660 


gtgctaaagg 


gcagaagctt ggccatatca 


caccgggtgg 


ggtgtcttgc 


ttctgtgcgt 


27720 


gagtgtgtga 


ggcacgcagg agaggggtgt 


gtaattatgt 


gctgtatcct 


tcatttctgc 


27780 
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t C C t C3 C 3. tt 


t aatgag3tt 


39467A.txt 

ggC33C33tl3 33tttgtCtt 


.txt 


atggtatata 


27840 


tttctatgct 


tc3ttctcac 


ttc3Ctttg3 sgggcttcca 


aaaaaaattt 


tatgggcaga 


27900 




ttgggatt cc 




ctgatacttg 




27960 


yy *-y La ■-y^y 


ttggstttts 


tegcttttgt tgttttcctc 


acaactgtgg 


caggaaaaga 


28020 


agatgacgat 


ctctg tc3g t 


"t"tc"tg39gc"t ggtttacctg 


ttttgcaaag 




28080 


agacaactaa 


C"t"tg tgt3ac 




aegtaaggag 


ccaaaagaca 


28140 




atgtgcsgct 


gcglatggca g^tcltcca 


aaggttggag 


ggtttgttac 


28200 


09 cctgagtg 


"tag 9 ttg ag 3 


333933'tg'tg CCagattCCt 


tcatccagtc 


acattgagct 


28260 


ct ctttctca 


ttccaggg t3 


*-*-yyyo-yy Ld - y L y L L 


cgccatggta 


agccacacat 


28320 


ccctcctggg 


cccct C3gtg 


gct3gtcatt cacctgtagg 


cagggtctaa 


gtttccagta 


28380 


39 aatgacsg 


3tCt CCCCt3 


tcctcgctaa aggeccagg t 


L yyyy aL yy 


aaggcttcaa 


28440 


aataaattga 


atagggaact 


tig at to a etc attagtgg cc 


ttatgaatgc 


cattttctaa 


28500 


ggtactaata 


cctcactggg 




rl~n1~nriri1~1~1~ 

L L y L yyy L L L 


gacatttttc 


28560 


tgggtg3C3C 


3tgacaggg3 


Iglagggtac ttccgclca'c 


ctttgaatgt 


gttttcttac 


28620 


ttt cctcttg 


gaaat agaaa 


ataaaaaaca acaccccacc 


ccacccccaa 


cacacacaca 


28680 






ctgaatatgt tct ct acccc 


atacctaccc 


ttttcttaac 


28740 




ttt C 33 tag 3 


acccacattt cagaagattt 


aatatatttg 


gaagactttt 


28800 


attcgcattg 


tC3tCtCttt 


aaagaaaaat gaggacagg t 


ggatttagga 


agcgcttccc 


28860 


tctgctccaa 


3t3g3tCCt t 


aaatatgagt gategtttag 


aaaactggca 


catgagtgag 


28920 


agcctttcac 


tgctgttgca 


gtcttttggc ctcaaagctg 


ctgagccgtt 


taaataatcg 


28980 




tcttggtggg 


tgg cgaggag 9333393330 


ccttaccatt 


tcttcccttg 


29040 


ccagtcccac 


cgttgacaag 


CC333ttg3t Ctttt33Q3Q 


atcaaatgaa 


tgttctctaa 


29100 


atatatgtac 


acacatggct 


gectggaaac gtatt cctt c 


cacagaatga 


ttgcctgaaa 


29160 


ttt 93.3.QQ3.Q 


agcgcagtaa 


agacaccagg ttggaagtgg 


ggttgaaggg 


ctagggggtg 


29220 


gagtggaggt 




9*-9tg catga ggcttcactt 


ttgtacactg 


tccttttggg 


29280 


3"t "t C339 9 tg 


tt^tcagta 




tttatcatct 


atttggtaat 


29340 


9 t C 3 t "t 9 C 3"t 


ttttsgctcc 


ctgtgtcttt tttgtcattg 


ggttacattc 


aagcacagta 


29400 


agatcaactt 


1 3333 C C"tC C 


ttactcaacs gctttattsg 


ttatagcatt 


ccatgacctt 


29460 


"tC"tC33C3"t"t 


cttaaagaaa 


33g3t3cagt g taatgtcgc 


tttactttgc 


ttattgtcct 


29520 


ttgttggggt 


gaacaaagca 


ttttctacag tggctatatc 


acataattat 


acagctttca 


29580 


at3gc3gtgt 


cttggcac3t 


atC33agttc agaggagect 


ttagaaaaaa 


aaaaagatgt 


29640 


tttgtggcsg 


cctagggagg 


gtctcatctt tccttcagaa 


aatagttcaa 


ggctcttctg 


29700 


tcsagcttcc 


ctacttagag 


ctttttctcc tcctgcttca 


taaagtttaa 


aggggattca 


29760 


gtggagttct 


atgatctatt 


tcctttgaaa gattgttcct 


eggcacagag 


aggecctttg 


29820 
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39467A.tXt.txt 



acttcaagag 


ttcacagatt 


catgtcttta ggtatcatat gtctgacctt 


atcagttact 


29880 


ccatttaatg 


taggagaaaa 


agtctcaact 


ctttgtgttt gtctgttttg 


cctctgtgaa 


29940 


atgatttggt 


gaaaagacca 


tcctttttaa 


cacaccactg 


agaggccgtt 


tctgactgta 


30000 


acctaccctg 


tggcttttct 


ctctttaaaa aaaaaaaaaa tcgtccttgt 


gttttgtgta 


30060 


tggatgagtt 


cacagtgaga 


atagaattat 


acaagggcag 


gcgcacacac 


aaaaaaatct 


30120 


ttgctttcct 


ccctcacctc 


ccgcaccccc 


ccacaaatga tctattggct 


ctctcggcgg 


30180 


ctgtacccca 


acaggcgaag 


ccatttagca aacacagagg tagcggctgt 


ggtgctggga 


30240 


cagtggtggg 


ttttcccttg 


cttcgaccta cccctaaggc cttcataatt 


aattgtcctt 


30300 


cagcgatgag 


gaaagttcag 


aaacagtgtg 


tggagtgatg 


cctattgtct 


gatattcagt 


30360 


tctccttgcc 


ttggttcttt 


ttcttcatcc 


cacaaagggt tatcaatggg 


agaaagagag 


30420 


caagttctct 


tctgagagct 


gctggtggtg gctgtagctt tcagtgggat 


gttatcattg 


30480 


tgttcagccc 


atcctggatt 


aaatgtctga agaagttcta acaacctttt 


gaaagacagc 


30540 


ctgtttattt 


cgcctagatg 


aaacaaattc atttagcaaa ccaaagcttg 


ttcgaagttg 


30600 


gccacccctt 


ttcacatggc 


agataacatt 


atagatcaaa tttcttcatt 


tttccccccg 


30660 


caggatgtta 


tttaacttga 


actgtttggt 


tctttgtcag 


tcacagggca 


gaaattttaa 


30720 


tgactattca 


ctcactgctc 


ttaaatacat 


caatattaat 


ttacaataat 


acagtttttg 


30780 


ctaacatcct 


ttttgatgaa 


gcgtagacgt 


ttaatacttg 


aaagcagata 


attagtttaa 


30840 


aaatattgtt 


tctccttcaa 


tgactgcctt 


cagccaatct 


tcaattctat 


cttgtaagat 


30900 


gatgtgaaac 


aaacgcattt 


tgtcttcctg 


caccccccaa 


tttttggctg 


agatacaaaa 


30960 


taaagatgca 


gtgtggagag 


agctatttga 


gaagggtagg 


aaaaagagaa 


ccgtctatta 


31020 


atgatcatta 


tactactgtt 


cctgttaaat 


agggtgaagc 


caagaaaaac 


aaatataatc 


31080 


gttcttccga 


ggagagcagt 


tgaactagta 


aatcacagag 


gtttaaaata 


actacattgt 


31140 


agtgttcatg 


acaacttcaa 


ggctgaaggg 


aaccatattt 


aaaggcaatc 


tctgtgtctc 


31200 


ttatagcagt 


ttcttttgga 


ggaagagacc 


gacaggatgg 


ccagaatcaa 


ttctgccccc 


31260 


tttgctcttt 


gaaaacaatt 


tcacaacaga 


ccttttggta 


tttaaagaga 


acctgtatat 


31320 


ggaagttgac 


acaactaata 


tagtcatacc 


aaaaaggggg 


tcataaaaaa 


ttaaagttct 


31380 


tcttatgaat 


ctttcatgag 


aagcaatgaa 


aagggacact 


agtgtagcca 


agttctttgt 


31440 


gctacaagct 


cttcttccgg 


gctctgagct attgttcttt 


cagctcctca 


aacagacttt 


31500 


cactttcaaa 


ctgacaaaag 


tcacttaaaa gccagacagc tgtactaaca 


cacccacctt 


31560 


actgagcaag 


agccactggc 


aggtgacaag gcctgctgag agaccttgtt 


gaaaatgagc 


31620 


aggggtgact 


ttctcgtgcc 


ttaacgttgc ttttgcactc actttgagat 


ggcccattga 


31680 


ctgctctttt 


tgccccccca 


ccccaaaaca ggctccccaa aatatgttgt 


gcattttctt 


31740 


tgcagtgtgc 


aacattgaca 


tccgtgatca tatttctgcc ttacacctgt 


gtggctaggc 


31800 


acgggttctg 


ggaaatttgt 


gcccttctag 


cagaagacag ggagtttgac 


tcacaaaact 


31860 
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cctgctgcct 
atttctgttg 
accttcggat 
ccctcagggc 
tcccctcccc 
tattttgctc 
tggcaaatgt 
gcctgtgccc 
attcatcttc 
tcttgcccaa 
ccttccccac 
atagctcgcc 
attgcttcca 
cttattatct 
atcctttatt 
tgtgagtaca 
ctttctctaa 
tctgccccag 
gttttggaaa 
ccacagcttt 
atatcttggg 
attaattgat 
taaattctct 
taagatttta 
gttagatcac 
aagaaaactt 
tacctatcaa 
acaaggtgag 
ctgtgcaggg 
tcgagatata 
catgagtttt 
aagataataa 
ctaaacgtga 
tgaagagggt 



cttttccttt tgcccctcca 
cctcactagg gttggataga 
ttaagatcta ggcagagatc 
agcagccgtc gtgggccatg 
tgtcaccttg tgtacagtct 
tgggggccct ggcacatctg 
cccacgccgc aaatgcttca 
aaatcccggc ttctttttgc 
ctggttcttc cccattttcc 
atcactccga agtctacaat 
tcctgaccac ctttcctact 
accatgaaga ctaacgtttt 
gtttatgatg agagacaggg 
gtgtgccttc ctcctccgtc 
tgcacctgtg cattgccaca 
gaaaatactc ttcatgcctc 
gtgaatcttc atctccactc 
ccctcaacaa tgacctacta 
cagctaaaac atttctctct 
tgtgaaggtg tactggtttc 
ctgtgatgga tactacataa 
ctgtgttagc atattatgag 
aactactgca acataaaatg 
gttttcaggt gtagttaaag 
tgccatgaat tgtaatttga 
cctcagagga atgagttgcc 
agaggggcac aagaatatgc 
aagagaggtg gtcagatcag 
agcatattgg gagaggctgg 
tccaaaatat tcctaacctc 
acttaaacct tttttaaact 
ttctctaagt gtattacctg 
ttcccctaga actccacagg 
tgtgtccata ttctccattt 



39467A.txt.txt 
ttcagttcaa atctcactta 

aaacacccac caaagatggg 

gttaggtggg tagtcctgcc 

ggaggcctcc ctgtgtgcgc 

ggtctgtgac actgatggtg 

cagagcccaa gcacatcttc 

ttagccctgc tgccggcctc 

tccgttcttt tgtgtagctg 

tcgacttctg aactccagat 

gcgaaatgaa gtgactcttt 

ttttttcccc caaatgaata 

caaacttgca atctgaaaag 

ttagaatgag tttggcatta 

cccactctgc ccccctcact 

ttttaccaat tttctgaaag 

gctgtgcacg ttacagtctt 

taccctctcc caaaaccact 

ttagatactt acagtgatta 

ctaaagtttt attctatata 

tacattagct gcagtaaatt 

ttggtatgtt taattttccc 

cagcttttcc aatagagttt 

atttaaatgt ctccatcttt 

gagttaagtg tatattttat 

aattcaagac aaagactctg 

acattgtacc gggttgctga 

atgttgcaaa tattaggacc 

gcgggtgggc tccccaaccc 

tacctgtcat tgaatcattt 

ccagttgccc accattatgg 

taatctcatt gtcagaatat 

ctgggaaaat actatcttct 

gatagccctt gttataatat 

cctttctgat tttacagact 



aggttttcag 


31920 


tgcaaacctc 


31980 


tgcatcccga 


32040 


attacaggcc 


32100 


attatgtcat 


32160 


tttgttgcgt 


32220 


cttgccagac 


32280 


atgatcatgt 


32340 


gtcccagttt 


32400 


acccttgaat 


32460 


gtgactttga 


32520 


acaccaagtg 


32580 


ttagatattg 


32640 


atttccttgg 


32700 


cactttgaaa 


32760 


ctgaaggttc 


32820 


gccccctcct 


32880 


acacttggct 


32940 


tctaacagag 


33000 


ttagagctta 


33060 


ttaaatttga 


33120 


aactagtttt 


33180 


gagcaaacca 


33240 


ggaaatcatg 


33300 


ttaagggtta 


33360 


gattttcaaa 


33420 


aatgtagcca 


33480 


attgtcagcc 


33540 


ttcaaaaggc 


33600 


ttttatcacc 


33660 


accactcctt 


33720 


ttttacggct 


33780 


cctgggattg 


33840 


ttgatcatta 


33900 
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ctccctctta 
ctcatacagt 
gcctgactgt 
gtgtgggcag 
ttgattaccc 
cttattggtt 
aaatatgatt 
ccctccagat 
agatagcctt 
tttctgcttt 
aatattttat 
ccactgtcag 
aggccctggg 
ttaccctgcc 
tcacagagtg 
tgtataagca 
acatctgtgg 
cgggcaatga 
acaagagcaa 
tccaggactc 
tggaaggagc 
tgtgctcacc 
ttttgtgact 
aatagagggg 
aagctttaag 
ttctaccctg 
ttttcgtttt 
atggcaattc 
gactgaattg 
ggccaaacaa 
tccatgcagg 
gtacgtctcc 
tttatttttt 
tttcagctca 



atcttcatct ctccagatta 
aagtgggctg ccctggatca 
gtttttctga gggacagtta 
tcatgccacg gtttggtgat 
caaaagttta tcctcagaag 
aaaatgatcc tatgagacca 
gtgtctatag gacataccca 
gttttcagtg ttcttgtaga 
agtcttgaac ttgagtctat 
atttgatgtt attctcaggc 
ataaaacatc cccttgaaat 
tggtcctgga tcaataccgt 
atcaaatccc agctctgctg 
tgtacttcag tttccttatc 
ttgtgagaat taaatgaatc 
ttcacattta ttcatttgca 
tccaattatt gcaccatttc 
gggagctctc atcagaattg 
gtgttcctgc tctgaatgtg 
tggggtcatc cgttccagat 
actggcctgg agggtctggc 
ttttggaaag tgataagatt 
ctagagtaga ggggcagttt 
tattctgcat tctctaaata 
cagtgagtat agaggccctg 
gtgactctgc tgcagcttaa 
ctcccaaggc cctatttgca 
ccttactggg attatagaga 
gcactatcat tagacttgct 
tgaggaaatt tgtcagtgaa 
cctcaacaca gactatcagc 
tctctttcag agcagaattg 
tgagacacag agtgttgcac 
ccacaacctc cacctctcgg 



39467A.txt.txt 
aggagctcta atccttttta 

ttttagctgc cctgctgtaa 

cagttactaa ctcacacagc 

ggtgccttgt gcacacccaa 

ctggaattct tgagttggat 

gctgatcaga ctcttggcaa 

gccaaataga aaataggcag 

tcaagcactg gggtatttga 

aataatgaca gctctggggg 

aggcaatgaa atgttcacct 

cttacaaaga aaatgctttg 

tgtaggactt acagcatgga 

ctttctagca gtgaaaccct 

tgtaaaatag gggatgtaat 

tacacaattg tattagcaca 

gagccaagta aatgttacct 

ctgctgaccc taaataggaa 

gaacatatat tcaacgtaaa 

gctgaaaagg cgacactagc 

gagaaggaca cgatgagatg 

tctggccata cctgcctcat 

aaattcaaga gtttcattct 

cattctagct ctgaaatttt 

aagtctcttt tgagtcttgg 

ggaatccaga tggcttccat 

ttatctcagt caaaatctcc 

gatctgaatc tcaacagtgc 

ctaatttttc aaattcatac 

gctcacttta tttgttgcct 

gccctcatgc cattgggttt 

atttataata tgcattaact 

gctatgtttt tttttttatt 

tgttgcctaa gctggagtac 

gctccagcga ttctcctgcc 



aaagcctaat 


33960 


tgcgcttcca 


34020 


agaactccag 


34080 


tgggactttt 


34140 


ctcagtagtg 


34200 


atactctggc 


34260 


atccaccctg 


34320 


catcatgagg 


34380 


aaagctccag 


34440 


gcaagtagtc 


34500 


gggagtcttt 


34560 


ctctccagcc 


34620 


ggcaagtgtc 


34680 


agtgactact 


34740 


aagtaagtgc 


34800 


tgttgctgtg 


34860 


agtaaacaaa 


34920 


actggttttc 


34980 


ctggaacagc 


35040 


ctgggggtgg 


35100 


tgtggtctac 


35160 


agctctgaaa 


35220 


gtgactctag 


35280 


tcatgttgca 


35340 


gtgaggcccc 


35400 


agggtgccca 


35460 


ccttggagac 


35520 


acaatttatt 


35580 


tggccagggt 


35640 


tctcacacat 


35700 


tctatataat 


35760 


cttttatttt 


35820 


agtggcatga 


35880 


tcagcctccc 


35940 
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aagtagctgg 
atataactac 
ttctttaaat 
tgttttgtat 
gatatgtatt 
cattaacatt 
tccctgagaa 
agaaatgatg 
ctgtgtttcc 
aggcgtacaa 
gtattgacag 
agtctagtgg 
tatttacctg 
aaatacagcc 
ctggaaatac 
atgggattga 
gtcataagct 
ctgtctgatt 
ccctttctca 
actcacattg 
ggcaaagctt 
aacgaaacag 
gtgtttgtgc 
ctgcttcttt 
gcttgtcttg 
ttgttaacac 
tgagctgatt 
aatatgtact 
aagtcttcct 
aggcagcctt 
aatcgaacta 
gaccattagg 
aaagtatggc 
tgacttaaat 



gattacaggt gtgcatcact 
cttccctact aagcctactt 
attccccaaa tgatagtaat 
gtgtgagagt ttgtggttgg 
cttcatattt cttatggcta 
ttaaattaga atatggtgct 
caagtaaata cttgggcagg 
ggaagctgac catatgtaga 
tttcatttag agcatccact 
gataatctgt gtagatacct 
catcctaggt tgtcccaccc 
acttccaaaa gcccccgagt 
tcttgaaaaa gggattccaa 
catccttagt tttgcaaagg 
agaatagaat ttctctgaca 
ggccaataat ccccatcctt 
attgccagca gagtgccagc 
tcctaccatc tgtatcacaa 
tcacaacctg cctgaacccc 
ctccatcaat agcaatacag 
aattatgaag gtaaagctgt 
gagtaactgt ggattatctg 
tatgttgttt ttggataatg 
taccacctca cccagtaatt 
aactcagaca ccaatggcac 
tacaaagtga ctggtatatg 
tgtcagatgt gtcatgaagc 
ggcaggaaag ctcccaatta 
ggtccttgac caccacgtgt 
aactccatcg ggggaatggt 
tcctccaaga tagagcagaa 
cctctagaaa agctgttagc 
ctatgggcag agctgcccct 
ttttaaacag ttgtaaaaaa 
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.36060 


cccttgagat 


36120 
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36240 
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36360 
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36420 


agtgacctga 


36480 


ccagtcctag 


36540 


attgggtagg 


36600 


ctttgcaatc 


36660 


gtagcatttg 


36720 


atagaattcc 


36780 


aaaccaaggg 


36840 


tatttggggt 


36900 


ctctgaagct 


36960 


tatgagcaat 


37020 


taggctcctt 


37080 


tattagccta 


37140 


tacattctcg 


37200 


caatattgga 


37260 


aagcttggcc 


37320 


agggctgtca 


37380 


aaaggggaat 


37440 


acctaagtat 


37500 


agcacattta 


37560 


ggtaagactg 


37620 


ccgtagacat 


37680 


caagtgaatc 


37740 


cagaatgaaa 


37800 


aggggctggc 


37860 


aagctaagaa 


37920 


gaccaaatat 


37980 
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ggcccacaga 


gcctatgtat 


ttattacctg gccctttact 


ggcaaatttt 


gctgaccacc 


38040 


ggctgaaggt 


tttttctctt 


ctgtgggaca tgaactctct 


gagattcctt 


ctagttctga 


38100 


agttccaaaa 


ttctgtgatt 


cctttttttt tttttttttg 


agatggagtc 


tcactctgtc 


38160 


acccaggctg 


gagtgcagtg 


gcatgatctc agctcactgc 


aacctccgcc 


tcttgggttc 


38220 


aagcaattct 


ctgcctcagc 


ctcctgaata gctgggattg 


cgggcgccag 


ccaccacgcc 


38280 


cggctaattt 


ttttgtattt 


ctagtagaga cggggtttca 


ccatcttggc 


caggttggta 


38340 


ttgaactcct 


gacctcatga 


ttcacccgcc tcagcctccc 


aaagtgctgg 


gattacaggc 


38400 


gtgagccacc 


gcacccggcc 


aattccatga gtctttgatg 


gaatagtctt 


ggtccagctc 


38460 


ttacctgaac 


agcctaccag 


atgagcaatt tctgcacagt 


gcttccagtt 


gtttttaaga 


38520 


tcttaacagt 


atctgtgtag 


tatctcaggg ggagagaatg 


aggtattagg 


ttttagtttt 


38580 


tgatgctttt 


tccttgattt 


tgcttgcata tttgtttgtt 


tgtttaaact 


tggaatcact 


38640 


ttttaagacc 


tatgcagagt 


ttgggagaga aggaaaattt 


gcttcatcgc 


gaccaataat 


38700 


gtgacaatta 


tgtttcctaa 


cacgtataat accaagacct 


ccatgtgtga 


gcaaataaac 


38760 


tagccactta 


aagcacgttc 


actgaccaaa tttcagcccc 


acgaaataat 


tttgacagtc 


38820 


tctcatagac 


atttgtcatt 


ctgctcctag caagctagta 


ctatcttcta 


ctggggctat 


38880 


ggaagagatg 


gttttactta 


ccttgatctc tacatgcaga 


attgccaatg 


gaatacttac 


38940 


ataatttaaa 


atgtatgcac 


aatttattaa acgtagaata 


gaagatgtta 


agacatcctt 


39000 


ttctattacc 


tgaaagtcac 


aattattcga aatgctcaaa 


tctagaacat 


tgttgataat 


39060 


tatataatat 


tttaacaaca 


catatgttat caacatcata 


atgctgtaga 


aattttattg 


39120 


tgaattttgt 


attttctaaa 


tactcttaaa agacaaagac 


tcaaattcag 


gtagaaaaac 


39180 


aaagaagata 


ctcagggtgt 


atctctgccc ttcattcatt 


gctgtggtca 


gagaagtctg 


39240 


tgtgaggggt 


ttggccggta 


gcagcccccc agatccgtac 


actgcagacc 


aaaattcagc 


39300 


tcctgtgatg 


cttttccatg 


gagtttccct gtcaattcaa 


ggtagatcct 


caacctccct 


39360 


ccttggcagt 


ttgcatgtga 


ctgttcattc tttttattac 


atttcctcca 


gggggccatt 


39420 


ttcaccatgt 


catatctgtt 


tgctatcagc atttataagg 


gctggtgtgg 


cattggagga 


39480 


tgtcaagtgg 


tctgacttgg 


aagtgtactg ccacaaactc 


catgtaggtg 


acaggaggag 


39540 


agacctgctt 


tcccgttgcc 


actttttgga ttatccctgc 


aactctttcc 


gtctggctga 


39600 


caaaaacctt 


ggggctattg 


ggtggctcat cacttctgct 


ccttctctag 


cctttccctg 


39660 


ggtttgcttc 


ccccaacccc 


cacaccccct cgcacattaa 


catgacattg 


cctggtgagc 


39720 


acagaagaga 


gcagcttcca 


ccagctgaaa cctctgatct 


caaactcact 


agagagtttg 


39780 


gcttcgggat 


tttggcaaga 


aggccgattg cccatcaggt 


cagcatgaat 


aaagatttct 


39840 


ttcttccctt 


cttttttaaa 


gtcaagcatc aaccgaaact 


gctcccaaag 


ctctgtctct 


39900 


caagacaatt 


taaccccttt 


cacctaagta cattttctat 


tttgaatgca 


tggtactttg 


39960 


ttttattctt 


ttcctgtgag 


atgaccaaga aatctactat 


atgtaaaatt 


tgaaagccaa 


40020 
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gtcaattcta 


aaccaggctt 


atcattttta 


aagtatgttt 


atccagcttt 


gtagtaggaa 


40080 


caagcagact 


gtttgaaggc 


cacatacttt tcaaaccctg 


gttgcaacac 


gtctgccccg 


40140 


ttttgaaact 


gtctttatct 


agccgagaaa acgaaaatct 


atttgacaaa 


gtggcactct 


40200 


ggccagttta 


tcttgcaata 


tggctttagc tcactgagtc 


tattgatttc 


cttaaattaa 


40260 


tgtttacaga 


atgctactga 


attttgctca acagaacatt 


gttctttcga 


agctttatat 


40320 


atatatatat 


ataaaagaga 


tacagactgt tattgccatg 


tgttcctttg 


tttagaccaa 


40380 


ggaaacatag 


tttttaggtt 


tttttttttc ttaagacagc 


cttgaactat 


agccacttcc 


40440 


tacaagcatt 


tacttttcac 


atatttaaac agcaaaacat 


gtaactagaa 


agtgggccca 


40500 


aactgcatgg 


gtattagacg 


aatctaatcc tcagtgttcc 


tgaaagctga 


atgccacctg 


40560 


gagcatcaga 


gggagaaagc 


ctttagtcct aagcccagat 


gttgctggag 


aaccttcctc 


40620 


tgcctcattt 


ggggtaactc 


ggcaggcacc 


cgaaagcaac 


ttcacagcca 


gtgctcctgg 


40680 


atcctgctag 


tttttccaaa 


cacaagcatc 


ctaataaaat 


tcaaacacca 


tttagctgtt 


40740 


tgggaactct 


aaatataaca 


tcttgccctt tgaccacggt 


gctcagtgtt 


caatacacaa 


40800 


aacctaatct 


ctaaagatga 


ttttaaaact 


gaccttccca 


gagaagtaca 


cgtatccatt 


40860 


cagctacgaa 


cagtgcagaa 


aacaggattt tgactcataa 


ttatgaaatg 


gccaaaataa 


40920 


aacttaggga 


acacaaagca 


acttttctca accggttgac 


tcagccaaca 


aactcaccca 


40980 


agcgaacctc 


ctcagagcac 


ctctcaaaac 


gatgctttgc 


agacatttat 


taatcacagt 


41040 


gaatgcttcc 


caggaattag 


ggctcctctt 


taaaatctca 


aacttgtaaa 


ccaccttata 


41100 


tttggatgat 


attttatgct 


tcccaaagtg 


cattcatgtt 


ttcttttcca 


tttgatcctc 


41160 


ccctggaatg 


agagggcact 


ggaatagaat 


ctcaggattc 


actgtgtata 


gcatcctgca 


41220 


ccattccttc 


tcttctggag 


ggcctgttag 


tccccggctg 


tacacacagg 


ataaatgcat 


41280 


gcatgactgc 


aaagggagac 


ccttagtaac 


cacatcttgt 


gaccatattt 


tacagctcca 


41340 


tgattcctct 


tttcagcctc 


tggcaggaga 


gtttagtgtg 


agtgagacag 


tgaagaggag 


41400 


cagcaataac 


gtatctgttc 


ttggcttttc 


atctgataat 


ctctatgagg 


agttactaaa 


41460 


gcatctgagt 


ttatccattt 


aagtccactc 


tgtctgcagt 


gtaagtcccc 


agcttgtgcc 


41520 


actgctgtca 


ggagatgagt 


ctctccttga 


tcgatattta 


cttaacaaac 


agcagggatg 


41580 


ggagagtttg 


tttagaggaa 


tcatgtgcac 


tctagggtga 


atgaatgctc 


gggaaagtac 


41640 


ttcaactatt 


tgtctccttc 


cctaagattt 


ttgtgtacgt 


gtgtgtgcac 


acacgtgtgc 


41700 


agatgcccat 


tctcttttta 


acttctccaa agacacttcg 


aagtcatcta 


gaaaaatacc 


41760 


tcgctatgta 


tgattggtac 


atcattatac 


cgttaaggag 


ctaatgatgc 


agatgcagtt 


41820 


tttctaaccc 


agcaaagttt 


ggttcttctt ttgtgctctt 


atatagagca 


caaaagagac 


41880 


tcttaggata 


aactaaatgc 


acaagcatct acctttgacc 


cctttcagat 


gagtggaagg 


41940 


gaagaaaata 


cggatggaaa 


caataaaagc agtttgacaa 


ggcagctctt 


cactatgtat 


42000 


ttttgatggc 


attacctata 


tatttttaaa ggcccacagg 


gacaaaaagt 


aactttctcc 


42060 
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aatttttcag agctgcttca gcattagata tatttaactc tactactgta tatgaattcc 42120 

■acggtgtgaa aattgagaga gcactgttct ttcgagttcc ctgaaacaat tgcttgaagg 42180 

ctcaagtcag cctcttgaat gcagttgact tggaggcatc tggggctaga tcgaggggtt 42240 

ttgtttctgg gtgtggggag aggctggggg gtggctgggg agttatttat ttatttgatt 42300 

ttgtgaatcg gagttgtaaa agccatctga aatattcatg cagaatagtc tgagaagccc 42360 

gtttctgttt tatttaccgc acagtagaac agccacagcg gattagttct acaatacccg 42420 

taacaaaagc ccaacagctg atgcatgtga tgttaggagg tgacaaaaca gttaaagtat 42480 

gctgctggct acaggcaagc agtcagcaga tgcagacaaa agggtttgtg acaagaataa 42540 

ctctctctcc aaggcgagca gtgaagagta tccaaaatac cagtaccctt ttctccttga 42600 

cattgtcttc ttacagtcag cattttattg cccttttata gtataaaaaa aaatggagga 42660 

ggaagaagaa ggaaaaccca cacacaaact aattcaccaa aatactaggc aggattgtac 42720 

tttcccattc gctagccatg cctgccagta cacgtgtcct tttccatttc tccatcgaag 42780 

caagtttgaa aaaaaaaatt agcttaaaag atcagctata aagatgattt cccttgaaaa 42840 

gtttgtaatc tattgatagg cttgataggc cattggagcc tttggttacg ggttgggggg 42900 

tgggtggcca gggaaagaag tcgatgcctg gtttgttttc tgtccatttc agtgaagatc 42960 

atttcagtga tgaaatgagg ccagagggcc aatttttaaa ggggattgag gagggaggag 43020 

tgtccatgga gaactgagca aggggcaagg tttaggtccc ccgcaagagg ctgatgaatg 43080 

agcttacgga cggttcagag gtgtgaaaaa tgagcttctc tgtctccaga aaataggaga 43140 

ggctgtcttc tttttaacct ttgtaattcc ccttctattc tctgtgacat tcattcagct 43200 

gccaagagcg tttggcaagg tttgggccag cgagcacact tccagtgacc gctaaccttg 43260 

gtatgtcctg acacttatga tgagtatctg caggacacag aaggcaggca gcctgctatg 43320 

tcaggctttt attatgtact gcagaggcta gggacagtca gtttaataaa acaaatcatc 43380 

cttgaaggta aagcaactgg gaagaggagg aagacaggag aaaaatgtgt ctttgccact 43440 

cattccgatg gaaaaaaaaa agaacagcaa aacaaccacc cacccaacac accgtgtgtg 43500 

tgtgtgtgtg tgtgtgtgtg tgtgtgcgcg cgcgcgcatt cgcgcacgca cacacacgcg 43560 

caacccagct gtggactggg cagacttgaa aacctcctct cattttctgc atttcatgga 43620 

agcccagaag gctcttgttt gctctgagga gactcaagtc tgtgatgaaa ttggtagaag 43680 

ctgatagcca acccccttca aatttatgca tatcttcaag tacctcatta ctttatattc 43740 

ttctccaaat atcaaggcaa gaccatctgg ggtgacgttc ctatattggg atgccttttt 43800 

atcaaaacaa agtttccact ctcctctcct gaggaacgct gggcaaagca gctcccacaa 43860 

tagcctcaga gttccagcca aagactttgg aagccttttg ttttttccct gtggcatgtc 43920 

caaaggcagg gccttctccc ctcctccgcc cgccctcccc agccgcctgc attgtcttgc 43980 

attccagtga cttgattgac tgttaccacc tgatgctgag gagatactct agggttcatt 44040 

ctgcagattg ttgggttcta ttaaaagaaa cctagataag ggattacttg tcactaaggg 44100 
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attttctgca gatgtttatt ggtgatggga aagccattag gtgtgaagag gtgcagaaaa 44160 

atatggacaa catcattctg ataagactgg tttctaagat gctcccacaa aacatcagaa 44220 

agtaccccct attattctgt taaatggagc tgggtgtttt caagcagagg taaaggtctc 44280 

tttttccatg ggtgatgttt ctatgtgtgg atgaaattca ctggaaccct ctcagaagat 44340 

cagttgctac ccaaaagtgt acctctggga gccaccaaac acatgagttg ctccagtagt 44400 

tcagtatctc attacaactt tcttttgtcc agtccagtcc attgcatgag tatcacctca 44460 

aagtaagcac tatattaact aatcatttta tttgttcaca aagaattcat ttcttcccaa 44520 

atataaacca ataaccaaag tctcctccag ggcatctttt ataccatttc catttatttt 44580 

gaagttacta gattctctgt ggtttttcaa gattacagag gcacagcttt tcaaggtttt 44640 

ggtgcctcat ataaatagta gaaattgctg aaaaagcatt aaaagggagc cagcatcgtt 44700 

taatgcaaag acaccttacc tcacagtaat ctcttcatct catcatttct tcatctcata 44760 

caatctcatg ctttcttcat ctataaagtg atgatttctg agatctattc gaactctttg 44820 

aattctacct tactttacca ttattttaaa cttctttttt tttttttatt tttgagatgg 44880 

ggtctcactg tcacccaggc tgtagtgcaa tggtgcaacc tcagctcact gcaacctccg 44940 

ccacctgggc tcaagccatc ctctcacctc cacctcccag tagctgggac cacaggcatg 45000 

tgccaccaca cccagctaat ttttttgcat ttttggtaga gacggggttt catcatgttg 45060 

ctcaggctga agcttccctt tattaagtat tgttaaagta ttaagtaact gccactctag 45120 

agcaatatgg agtaaagcag aaggcaagat ctcactatga gctatttacc aaataacttt 45180 

gcaaaagata ctctgctgag gctccttatc tagagacacc ttatgatgag gtaattgaaa 45240 

gtacataaaa gtagataaaa agttaaacag catcaagaca caaatgcaaa aggtgataaa 45300 

ggataaccta tgattgccac cacaagaaag gaatatttaa aacagattaa aacccactaa 45360 

aaaccattaa caagcatgac gaactataaa aatgatgaag aggagactgc atacaacccc 45420 

caaagaagtt gccttgttct catgcaaatc ctacaactac acttccctcc ctcccctgct 45480 

gctgatgttc tagatgtacc tcttctctct cctctgacag tcttgaacaa tgcctgccct 45540 

tcccctgtcc ctggttcccc agacctcctg tgcagttctt ggtgtgggca gggcttccgg 45600 

ccttctctgg cttctctggg gcagctgccc acaccttcac ccctcaaagc tctctgccat 45660 

gtcatgctgc atccctgagt gctcaaggaa catagaattt cactgaggct gtattgccgt 45720 

tggctgatga aaccaccctt cttgaaacgt ttattttaat aaatgcctat aattggccag 45780 

gtgcagtggc tcacacctgt aatctcagca ctttgggagg ccaagacggg cagatcacct 45840 

gaggttggga gttggagacc agcctggcca acatggtgaa accccatctc tactgaaaat 45900 

acaaaagtta gccaggcgtg atggcacttg cctgtaatct cagctactca ggaggctaag 45960 

gcaggagagt ctcatgaacc caggaggcag aggttgcagt gagccaggat catgccactg 46020 

cactccagcc tgggtgacag agcaaaactc catctcaaat aaataaatta ataaatgcct 46080 

atgattatgt ttctgtagca tttggctaac agctcccaat ccaaggagtg agagtgggca 46140 
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gttgctccgc 


ttcactgttc 


tccagccaca ttccctccct 


cagtgatgct 


catttgatag 


46200 


aatgtggagg 


attatctttg 


ggggtggagg tgactgtgct 


agaaaagatt 


gcttcacgaa 


46260 


tttttatttg 


tataatgtga 


gtgggagggc taagctctcc 


tccaacaaat 


actcatgtat 


46320 


acaagacatt 


tgggaggaaa 


tcacccaaag gcctgtagaa 


aatccacatg 


aattctcagc 


46380 


agagaatggc 


ccttgaggtg 


tatgggtttg cacattcatg 


gcggacaagg 


cggcactttg 


46440 


aaggattttc 


caggcaacac 


tgggaattat gtcctaagaa 


atgggccagt 


gtgaaagtct 


46500 


ttaggagggt 


ctgataaaaa 


tgtaagctta agactgattg 


gccccaaaag 


gagtcccttt 


46560 


catttttttc 


tgcagagtta 


ttacatttct ttataaacaa 


caattaactt 


gccataggga 


46620 


acaatgaact 


tctttgtcca 


attttaaacg tgaaaaacag 


tgatgtcggg 


tgatgattct 


46680 


ggttttcttt 


accagttact 


actattgtta aaaagtacat 


tgcacccaag 


gtgggaagaa 


46740 


agagatgaaa 


catgttcaac 


attacactac ttccttttta 


ctttggtacg 


tggcatgtct 


46800 


gaacttagat 


gaaatgtctt 


tcatctcttg tatatgcgta 


gataaatatg 


gctacatgta 


46860 


cacctatgat 


acgtttatgt 


cctcatacgt ctgcacttaa 


tgtaaaaatg 


aaactttact 


46920 


ggtgtataag 


taccccacta 


aaagaaatct actaagtgtc 


aatgtgtact 


tggaaaatca 


46980 


tgagttcatg 


gattattctg 


tgattccatt atgttggtgt 


ggggatagat 


agaccatgct 


47040 


gtactataag 


taacttccaa 


agaacactaa ataagtacat 


cagtagctac 


tgctttcctt 


47100 


agtcaagaga 


tcagattaat 


aagtaattaa gagaacacac 


acacacacac 


aacacacata 


47160 


catattaatt 


gctgtggaag 


aaaagcctta agaaattggg 


gttctaaaat 


gaatatttgg 


47220 


ggaatgttta 


ttttggatga 


taaggacctt gaggaatttc 


cttaccctct 


ctgagcctca 


47280 


gttttctatt 


gtgtaactgg 


gataataaca ccccttagag 


agattgggag 


aactgaatga 


47340 


cataattcac 


attcagtaca 


taaaacatag cctggcaagt 


agtaaatact 


cgaaaaaagt 


47400 


tagtttgtat 


tattattatt 


atcagctgaa taaatcactc 


tcttatggag 


caattctaat 


47460 


ctcaaggtta 


agtagtttct 


gatgtaatat tttaggatca 


gttttgtgac 


ttcatgttaa 


47520 


tattattatt 


ttactccttt 


atgtatatag aatactttat 


attgcagatt 


aatatacaac 


47580 


ttagcatctg 


agtcaacaat 


cctctgagac aaacagataa 


ctgagatttt 


agaagatttt 


47640 


cttcatttaa 


agcttgggtt 


taatttataa agaagcccaa 


ctatttgtta 


ttctattttg 


47700 


agaacgtatt 


ttgttttcat 


catggcaatc aaaaagaaat 


aggattcaaa 


ttctgaaaaa 


47760 


ataattggag 


actttcttct 


ggatagcact tatttaataa 


agtgaggaat 


cccaaaagtc 


47820 


acatcccata 


ttcctatcct 


aatatccaca atgaaatccc 


agtttttcaa 


taggtctgcg 


47880 


ttggatcttt 


catacactct 


tcttaaaaca aagctgtcaa 


ccccacatca 


caatgcttct 


47940 


atatataatg 


actttacatt 


aaaagaatag aagccagcta 


tttttagaaa 


atgcaggtgc 


48000 


catgtaagcc 


cctttctgca 


agaatgatct tagctcagtt 


tccttggaat 


aactgtagac 


48060 


ttgaaactga 


aaactttatt 


aatgccattg tctccttgta 


tcagcaggtt 


ccagagagat 


48120 


tcctggaagt 


tgctcagatc 


acattacggg agtttttcaa 


tgccattatc 


gcaggcaaag 


48180 
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atgttgatcc ttcctggaag aaggccatat acaaggtcat ctgcaagctg gatagtgaag 48240 

tccctgagat tttcaaatcc ccgaactgcc tacaagagct gcttcatgag tagaaatttc 48300 

aacaactctt tttgaatgta tgaagagtag cagtcccctt tggatgtcca agttatatgt 48360 

gtctagattt tgatttcata tatatgtgta tgggaggcat ggatatgtta tgaaatcagc 48420 

tggtaattcc tcctcatcac gtttctctca ttttcttttg ttttccattg caaggggatg 48480 

gttgttttct ttctgccttt agtttgcttt tgcccaaggc ccttaacatt tggacactta 48540 

aaatagggtt aattttcagg gaaaaagaat gttggcgtgt gtaaagtctc tattagcaat 48600 

gaagggaatt tgttaacgat gcatccactt gattgatgac ttattgcaaa tggcggttgg 48660 

ctgaggaaaa cccatgacac agcacaactc tacagacagt gatgtgtctc ttgtttctac 48720 

tgctaagaag gtctgaaaat ttaatgaaac cacttcatac atttaagtat tttgtttggt 48780 

ttgaactcaa tcagtagctt ttccttacat gtttaaaaat aattccaatg acagatgagc 48840 

agctcacttt tccaaagtac cccaaaaggc caaattaaaa aagaaaaata atcactctca 48900 

agccttgtct aagaaaagag gcaaactctg aaagtcgtac cagtttcttc tggaggcaaa 48960 

gcaattttgc acaaaaccag ctctctcaag atgagactag aaattcatac ctggtcttgt 49020 

agccacctct ctaaacttga aaataggttc ttcttcataa gtgagcttac atcattcttc 49080 

ataaagaaaa atcctataac ttgttatcat ttttgcttca gatactaaaa ggcactaagt 49140 

ttccaattta cgctgctcaa ctttgtttat atgcttaaaa ggattctgtt tacttaacaa 49200 

ttttttcccc taaaatacta ttttctgaat acttccttcc agtaaggaat aaaggaaagc 49260 

ccaacttggc cataa 49275 

<210> 2 

<211> 3097 

<212> DNA 

<213> Homo sapiens 



<220> 






<221> misc_feature 




<223> Prox-1 DNA 




<400> 2 




60 


ggcacgaggc 


cccttttcca gaatcacttg cactgtcttg ttcttgaatg agaaaggaag 


aaaagagcct 


cccattactc agacccgtgt aaacattatt ccccccagga gaaaatggtg 


120 


ttattcaaat 


gaatcataat aaaatagcct ctaaacagtt tctaagcggg agcctccgtg 


180 


gaactcagcg 


ctccgctcct cccagttcct aagaggtccc gggattcttg agctgtgccc 


240 


agctgacgag 


cttttgaaga tggcacaata accgtccagt gatgcctgac catgacagca 


300 


cagccctctt 


aagccggcaa accaagagga gaagagttga cattggagtg aaaaggacgg 


360 


tagggacagc 


atctgcattt tttgctaagg caagagcaac gttttttagt gccatgaatc 


420 


cccaaggttc 


tgagcaggat gttgagtatt cagtggtgca gcatgcagat ggggaaaagt 


480 


caaatgtact 


ccgcaagctg ctgaagaggg cgaactcgta tgaagatgcc atgatgcctt 


540 
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ttccaggagc 


aacca~taat~t 


39467A.txt 


.txt 

catigaacaaa 




600 


9 9 ^ 9 c c c 39 


tttccaagcc 


agcggtctct ctagt^cagg 


ctccgaagta 


cltc!gglgg 


660 


atatatgcsg 


caactcttca 


agagacagcc cccca.gag tg 


tctttcccct 


"tttggcaggc 


720 


c "t a. c t a. "t 9 3. 9 


ccagtttgat 


atggatcgct tatgtgatga 


gcacctgaga 


geaaagegeg 


780 


cccgggttga 


gaatataatt 


egg gg tatga gccattcccc 


cagtgtggca 


"ttaaggggca 


840 








ccgagaaagt 


tacagagaaa 


900 


acalacgcal 


gcl^gctt 


!cccagcagc Ig^wclgag 


tttccagcag 


ctiggtittcag 


960 


c cc 9 3.3.3.3.93. 


acagaagcga 


gag gage 9 cc gacagctgaa 


acagcagctg 


gaggacatgc 


1020 


3. 9 3 3. 3. C 3. 9 C "t 


gcgccagctg 


caggaaaagt tctaccaaat 


ctatgacagc 


actgattegg 


1080 


aaaatgatga 


agatggtaac 


ctgtctgaag acagcatgeg 


ct egg agate 


ctggatgcca 


1140 


gggcccagga 


ctctgtcgga 






ccaggacagt 


1200 


ttattgaccg 




rtgat^gag ^clgg^t 


ggctgaaaac 


aagecgaage 


1260 


gagaaggcaa 


caacaaagaa 


3-93-9 acc at g ggecaaact c 


cttacaaccg 


gaaggcaaac 


1320 


atttggctga 




caggaactga acactgccat 


gtcgcaagtt 


gtggacactg 


1380 


tggtcaaagt 


ctttt cggcc 


aagcect ccc g ccaggttcc 


tcaggtcttc 


ccacctctcc 


1440 


agatccccca 




gcagt caatg gggaaaacca 


caatttccac 


accgccaacc 


1500 


agcgcctgca 


g t g c 1 t ~t g g c 


gaegt cat ca t tccgaaccc 


cctggacacc 


tttggcaatg 


1560 


tgcagatggc 




gaccagacag aagcactgcc 


cctggttgtc 


cgcaaaaact 


1620 




gtctgcctcc 


ggccctgccg ctggcggcca 


ccaccagccc 


ctgcaccagt 


1680 


cgcctct etc 


tgccaccacg 


ggctt caeca cgtccacctt 


ccgccacccc 


ttcccccttc 


1740 


ccttgatggc 


ctatccattt 


cagagcccat taggtgetec 


ctccggctcc 


ttctctggaa 


1800 


aaga.caga.gc 


ctctcctgaa 


"tccttagact taactaggga 


taccacgagt 


ctgaggacca 


1860 


agatgtcatc 




agccaccacc cttgtt cacc 


agcacacccg 


cccagcaccg 


1920 


ccgaagggct 


ctcct"tg"tcg 


ctcataaagt ccgagtgcgg 


cgatcttcaa 


gatatgtctg 


1980 


aaatatcacc 




^■9 ^-9 caatgc aggaaggatt 


gtcacccaat 


cacttgaaaa 


2040 


aagcaaagct 


catgtttttt 


tataccegtt atcccagct c 


caatatgetg 


aagacctact 


2100 


tctccgacgt 




agatgeatta cctct cagct 


catcaagtgg 


tttagcaatt 


2160 


tccgtgagtt 


ttactacatt 


cagatggaga agtacgcacg 


teaagecatx 


aacgatgggg 


2220 


t caccagtac 


tgaagagctg 


tctataacca gagactgtga 


gctgtacagg 


gctctgaaca 


2280 




taaagcaaat 


gactttgagg ttccagagag 


attcctggaa 


gttgetcaga 


2340 


tclclttacg 


ggagtttttc 


aatgecatta tegcaggcaa 


agatgttgat 


ccttcctgga 


2400 


agaaggccat 


atacaaggtc 


atetgeaage tggatagtga 


agtccctgag 


attttcaaat 


2460 


ccccgaactg 


cctacaagag 


ctgcttcatg agtagaaatt 


tcaacaactc 


tttttgaatg 


2520 


tatgaagagt 


agcagtcccc 


tttggatgtc caagttatat 


gtgtctagat 


tttgatttca 


2580 
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tatatatgtg tatgggaggc atggatatgt tatgaaatca gctggtaatt cctcctcatc 2640 

acgtttctct cattttcttt tgttttccat tgcaagggga tggttgtttt ctttctgcct 2700 

ttagtttgct tttgcccaag gcccttaaca tttggacact taaaataggg ttaattttca 2760 

gggaaaaaga atgttggcgt gtgtaaagtc tctattagca atgaagggaa tttgttaacg 2820 

atgcatccac ttgattgatg acttattgca aatggcggtt ggctgaggaa aacccatgac 2880 

acagcacaac tctacagaca gtgatgtgtc tcttgtttct actgctaaga aggtctgaaa 2940 

atttaatgaa accacttcat acatttaagt attttgtttg gtttgaactc aatcagtagc 3000 

ttttccttac atgtttaaaa ataattccaa tgacagatga gcagctcact tttccaaagt 3060 

accccaaaag gccaaattaa aaaaaaaaaa aaaaaaa 3097 

<210> 3 

<211> 737 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Prox-1 Protein 

<400> 3 

Met Pro Asp His Asp Ser Thr Ala Leu Leu Ser Arg Gin Thr Lys Arg 
15 10 15 

Arg Arg Val Asp lie Gly Val Lys Arg Thr Val Gly Thr Ala Ser Ala 
20 25 30 

Phe Phe Ala Lys Ala Arg Ala Thr Phe Phe Ser Ala Met Asn Pro Gin 
35 40 45 

Gly Ser Glu Gin Asp Val Glu Tyr Ser Val Val Gin His Ala Asp Gly 
50 55 60 

Glu Lys ser Asn Val Leu Arg Lys Leu Leu Lys Arg Ala Asn Ser Tyr 
65 70 75 80 

Glu Asp Ala Met Met Pro Phe Pro Gly Ala Thr lie lie Ser Gin Leu 
85 90 95 

Leu Lys Asn Asn Met Asn Lys Asn Gly Gly Thr Glu Pro Ser Phe Gin 
100 ' 105 110 

Ala Ser Gly Leu Ser Ser Thr Gly Ser Glu Val His Gin Glu Asp lie 
115 120 125 

Cys Ser Asn Ser Ser Arg Asp Ser Pro Pro Glu Cys Leu Ser Pro Phe 
130 135 140 

Gly Arg Pro Thr Met Ser Gin Phe Asp Met Asp Arg Leu Cys Asp Glu 
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145 150 155 160 

His Leu Arg Ala Lys Arg Ala Arg Val Glu Asn lie lie Arg Gly Met 
165 170 175 

Ser His ser Pro ser val Ala Leu Arg Gly Asn Glu Asn Glu Arg Glu 
180 185 190 

Met Ala Pro Gin Ser Val Ser Pro Arg Glu Ser Tyr Arg Glu Asn Lys 
195 200 205 

Arg Lys Gin Lys Leu Pro Gin Gin Gin Gin Gin ser Phe Gin Gin Leu 
210 215 220 

Val ser Ala Arg Lys Glu Gin Lys Arg Glu Glu Arg Arg Gin Leu Lys 
225 230 235 240 

Gin Gin Leu Glu Asp Met Gin Lys Gin Leu Arg Gin Leu Gin Glu Lys 
245 250 255 

Phe Tyr Gin He Tyr Asp ser Thr Asp ser Glu Asn Asp Glu Asp Gly 
260 265 270 

Asn Leu ser Glu Asp ser Met Arg Ser Glu lie Leu Asp Ala Arg Ala 
275 280 285 

Gin Asp ser Val Gly Arg Ser Asp Asn Glu Met cys Glu Leu Asp Pro 
290 295 300 

Gly Gin Phe lie Asp Arg Ala Arg Ala Leu lie Arg Glu Gin Glu Met 
305 310 315 320 

Ala Glu Asn Lys Pro Lys Arg Glu Gly Asn Asn Lys Glu Arg Asp His 
325 330 335 

Gly Pro Asn Ser Leu Gin Pro Glu Gly Lys His Leu Ala Glu Thr Leu 
340 345 350 

Lys Gin Glu Leu Asn Thr Ala Met Ser Gin Val Val Asp Thr Val Val 
355 360 365 

Lys Val Phe Ser Ala Lys Pro Ser Arg Gin Val Pro Gin Val Phe Pro 
• 370 375 380 

Pro Leu Gin lie Pro Gin Ala Arg Phe Ala Val Asn Gly Glu Asn His 
385 390 395 400 

Asn Phe His Thr Ala Asn Gin Arg Leu Gin Cys Phe Gly Asp Val lie 
405 410 415 

lie Pro Asn Pro Leu Asp Thr Phe Gly Asn Val Gin Met Ala Ser Ser 
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420 

Thr Asp Gin Thr Glu Ala Leu Pro Leu Val Val Arg Lys Asn Ser Ser 
435 440 445 

Asp Gin ser Ala ser Gly pro Ala Ala Gly Gly His His Gin Pro Leu 
450 455 460 

His Gin Ser Pro Leu Ser Ala Thr Thr Gly Phe Thr Thr Ser Thr Phe 
465 470 475 480 

Arg His Pro Phe Pro Leu pro Leu Met Ala Tyr Pro Phe Gin ser Pro 
485 490 495 

Leu Gly Ala Pro Ser Gly Ser Phe Ser Gly Lys Asp Arg Ala Ser Pro 
500 505 510 

Glu Ser Leu Asp Leu Thr Arg Asp Thr Thr Ser Leu Arg Thr Lys Met 

515 520 ■ 525 

ser ser His His Leu ser His His Pro cys ser Pro Ala His pro Pro 
530 535 540 

Ser Thr Ala Glu Gly Leu Ser Leu Ser Leu lie Lys Ser Glu Cys Gly 
545 550 555 560 

Asp Leu Gin Asp Met Ser Glu lie Ser Pro Tyr Ser Gly ser Ala Met 
565 570 575 

Gin Glu Gly Leu Ser Pro Asn His Leu Lys Lys Ala Lys Leu Met Phe 
580 585 590 

Phe Tyr Thr Arg Tyr Pro Ser Ser Asn Met Leu Lys Thr Tyr Phe Ser 
595 600 605 

Asp Val Lys Phe Asn Arg cys lie Thr ser Gin Leu lie Lys Trp phe 
610 615 620 

Ser Asn Phe Arg Glu Phe Tyr Tyr lie Gin Met Glu Lys Tyr Ala Arg 
625 630 635 640 

Gin Ala lie Asn Asp Gly Val Thr Ser Thr Glu Glu Leu Ser lie Thr 
645 650 655 

Arq Asp cys Glu Leu Tyr Arg Ala Leu Asn Met His Tyr Asn Lys Ala 
660 665 670 

Asn Asp Phe Glu Val Pro Glu Arg Phe Leu Glu Val Ala Gin lie Thr 
675 680 . 685 

Leu Arg Glu Phe Phe Asn Ala lie lie Ala Gly Lys Asp Val Asp Pro 
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690 695 

Ser Trp Lys Lys Ala lie Tyr Lys Val lie Cys Lys Leu Asp Ser Glu 
705 710 715 720 

Val Pro Glu lie Phe Lys Ser Pro Asn Cys Leu Gin Glu Leu Leu His 
725 730 735 



<210> 4 

<211> 21 

<212> RNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Prox-1 A16 sense 

<400> 4 

cugcaagcug gauagugaag u 



<210> 5 

<211> 21 

<212> RNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Prox-1 Al6 anti sense 

<400> 5 

uucacuaucc agcuugcaga u 



<210> 6 

<211> 21 

<212> RNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Prox-1 A25 sense 

<400> 6 

cuaugagcca guuuga'uauu u 



<210> 7 

<211> 21 

<212> RNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Prox-1 A25 anti sense 
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auaucaaacu ggcucauagu u 21 

<210> 8 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> egfp A18 sense 

<400> 8 

gacguaaacg gccacaaguu u 21 



<210> 9 

<211> 21 

<212> RNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> EGFP A18 anti sense 

<400> 9 

acuuguggcc guuuacgucu u 



<210> 10 

<211> 3362 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Beta-catenin 



aagcctctcg 


gtctgtggca 


gcagcgttgg 


cccggccccg 


ggagcggaga gcgaggggag 


60 


gcggagacgg 


aggaaggtct 


gaggagcagc 


ttcagtcccc 


gccgagccgc caccgcaggt 


120 


cgaggacggt 


cggactcccg 


cggcgggagg 


agcctgttcc 


cctgagggta tttgaagtat 


180 


accatacaac 


tgttttgaaa 


atccagcgtg 


gacaatggct 


actcaagctg atttgatgga 


240 


gttggacatg 


gccatggaac 


cagacagaaa 


agcggctgtt 


agtcactggc agcaacagtc 


300 


ttacctggac 


tctggaatcc 


attctggtgc 


cactaccaca 


gctccttctc tgagtggtaa 


360 


aggcaatcct 


gaggaagagg 


atgtggatac 


ctcccaagtc 


ctgtatgagt gggaacaggg 


420 


attttctcag 


tccttcactc 


aagaacaagt 


agctgatatt 


gatggacagt atgcaatgac 


480 


tcgagctcag 


agggtacgag 


ctgctatgtt 


ccctgagaca 


ttagatgagg gcatgcagat 


540 


cccatctaca 


cagtttgatg 


ctgctcatcc cactaatgtc cagcgtttgg ctgaaccatc 


600 


acagatgctg 


aaacatgcag 


ttgtaaactt gattaactat caagatgatg cagaacttgc 


660 


cacacgtgca 


atccctgaac 


tgacaaaact gctaaatgac gaggaccagg tggtggttaa 


720 


taaggctgca 


gttatggtcc 


atcagctttc taaaaaggaa gcttccagac acgctatcat 


780 
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gcgttctcct 
aacagctcgt 
ggccatcttt 
ttctgtgttg 
taaaatggca 
aaatgttaaa 
agaaagcaag 
gacctatact 
ctgctctagt 
cctgacagat 
agatgctgca 
gggttcagat 
caataattat 
tactgtcctt 
tcatctgacc 
tggactacca 
tactgttgga 
gcagggtgcc 
ccgtacgtcc 
agttgaaggt 
tatcagagga 
catccaaaga 
agctattgaa 
aggtgtggcg 
ttacaagaaa 
ttggaatgag 
tcgccaggat 
gggtatggac 
agttgatggg 
tgacagcaat 
tctgaacttg 
tatctcagaa 
aaactttttg 
gactcaagaa 



cagatggtgt ctgctattgt 
tgtaccgctg ggaccttgca 
aagtctggag gcattcctgc 
ttttatgcca ttacaactct 
gtgcgtttag ctggtgggct 
ttcttggcta ttacgacaga 
ctcatcatac tggctagtgg 
tacgaaaaac tactgtggac 
aataagccgg ctattgtaga 
ccaagtcaac gtcttgttca 
actaaacagg aagggatgga 
gatataaatg tggtcacctg 
aagaacaaga tgatggtctg 
cgggctggtg acagggaaga 
agccgacacc aagaagcaga 
gttgtggtta agctcttaca 
ttgattcgaa atcttgccct 
attccacgac tagttcagtt 
atgggtggga cacagcagca 
tgtaccggag cccttcacat 
ctaaatacca ttccattgtt 
gtagctgcag gggtcctctg 
gctgagggag ccacagctcc 
acatatgcag ctgctgtttt 
cggctttcag ttgagctgac 
actgctgatc ttggacttga 
gatcctagct atcgttcttt 
cccatgatgg aacatgagat 
ctgccagatc tggggcatgc 
cagctggcct ggtttgatac 
cattgtgatt ggcctgtaga 
agtgcctgac acactaacca 
ttctggtcct ttttggtcga 
gtgaagaatg cacaagaatg 



39467A.txt.txt 
acgtaccatg cagaatacaa 

taacctttcc catcatcgtg 

cctggtgaaa atgcttggtt 

ccacaacctt ttattacatc 

gcagaaaatg gttgccttgc 

ctgccttcaa attttagctt 

tggaccccaa gctttagtaa 

cacaagcaga gtgctgaagg 

agctggtgga atgcaagctt 

gaactgtctt tggactctca 

aggtctcctt gggactcttg 

tgcagctgga attctttcta 

ccaagtgggt ggtatagagg 

catcactgag cctgccatct 

gatggcccag aatgcagttc 

cccaccatcc cactggcctc 

ttgtcccgca aatcatgcac 

gcttgttcgt gcacatcagg 

atttgtggag ggggtccgca 

cctagctcgg gatgttcaca 

tgtgcagctg ctttattctc 

tgaacttgct caggacaagg 

tctgacagag ttacttcact 

gttccgaatg tctgaggaca 

cagctctctc ttcagaacag 

tattggtgcc cagggagaac 

tcactctggt ggatatggcc 

gggtggccac caccctggtg 

ccaggacctc atggatgggc 

tgacctgtaa atcatccttt 

gttgctgaga gggctcgagg 

agctgagttt cctatgggaa 

ggagtaacaa tacaaatgga 

gatcacaaga tggaatttag 



atgatgtaga 


840 


agggcttact 


900 


caccagtgga 


960 


aagaaggagc 


1020 


tcaacaaaac 


1080 


atggcaacca 


1140 


atataatgag 


1200 


tgctatctgt 


1260 


taggacttca 


1320 


ggaatctttc 


1380 


ttcagcttct 


1440 


acctcacttg 


1500 


ctcttgtgcg 


1560 


gtgctcttcg 


1620 


gccttcacta 


1680 


tgataaaggc 


1740 


ctttgcgtga 


1800 


atacccagcg 


1860 


tggaagaaat 


1920 


accgaattgt 


1980 


ccattgaaaa 


2040 


aagctgcaga 


2100 


ctaggaatga 


2160 


agccacaaga 


2220 


agccaatggc 


2280 


cccttggata 


2340 


aggatgcctt 


2400 


ctgactatcc 


2460 


tgcctccagg 


2520 


agctgtattg 


2580 


ggtgggctgg 


2640 


caattgaagt 


2700 


ttttgggagt 


2760 


caaaccctag 


2820 
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ccttgcttgt taaaattttt tttttttttt ttttaagaat atctgtaatg gtactgactt 2880 

tgcttgcttt gaagtagctc tttttttttt tttttttttt tttttttgca gtaactgttt 2940 

tttaagtctc tcgtagtgtt aagttatagt gaatactgct acagcaattt ctaattttta 3000 

agaattgagt aatggtgtag aacactaatt aattcataat cactctaatt aattgtaatc 3060 

tgaataaagt gtaacaattg tgtagccttt ttgtataaaa tagacaaata gaaaatggtc 3120 

caattagttt cctttttaat atgcttaaaa taagcaggtg gatctatttc atgtttttga 3180 

tcaaaaacta tttgggatat gtatgggtag ggtaaatcag taagaggtgt tatttggaac 3240 

cttgttttgg acagtttacc agttgccttt tatcccaaag ttgttgtaac ctgctgtgat 3300 

acgatgcttc aagagaaaat gcggttataa aaaatggttc agaattaaac ttttaattca 3360 



<210> 11 

<211> 781 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Beta-catenin 



<400> 11 

Met Ala Thr Gin Ala Asp Leu Met Glu Leu Asp Met Ala Met Glu Pro 
15 10 15 

Asp Arg Lys Ala Ala Val Ser His Trp Gin Gin Gin ser Tyr Leu Asp 
20 25 30 

ser Gly lie His Ser Gly Ala Thr Thr Thr Ala Pro Ser Leu Ser Gly 
35 40 45 

Lys Gly Asn Pro Glu Glu Glu Asp Val Asp Thr Ser Gin Val Leu Tyr 
50 55 60 

Glu Trp Glu Gin Gly Phe ser Gin Ser Phe Thr Gin Glu Gin Val Ala 
65 70 75 80 

Asp lie Asp Gly Gin Tyr Ala Met Thr Arg Ala Gin Arg Val Arg Ala 
85 90 95 

Ala Met Phe Pro Glu Thr Leu Asp Glu Gly Met Gin lie Pro Ser Thr 
100 105 110 

Gin Phe Asp Ala Ala His pro Thr Asn Val Gin Arg Leu Ala Glu Pro 
115 120 125 

Ser Gin Met Leu Lys His Ala Val Val Asn Leu He Asn Tyr Gin Asp 
130 135 140 
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Asp Ala Glu Leu Ala Thr Arg Ala lie Pro Glu Leu Thr Lys Leu Leu 
145 150 155 160 



Asn Asp Glu Asp Gin val val Val Asn Lys Ala Ala val Met Val His 
165 170 175 



Gin Leu Ser Lys Lys Glu Ala Ser Arg His Ala lie Met Arg Ser Pro 
180 185 190 



Gin Met val ser Ala lie val Arg Thr Met Gin Asn Thr Asn Asp val 
195 200 205 



Glu Thr Ala Arg Cys Thr Ala Gly Thr Leu His Asn Leu Ser His His 
210 215 220 



Arg Glu Gly Leu Leu Ala lie Phe Lys Ser Gly Gly lie Pro Ala Leu 
225 230 235 240 



val Lys Met Leu Gly ser Pro val Asp ser val Leu Phe Tyr Ala lie 
245 250 255 



Thr Thr Leu His Asn Leu Leu Leu His Gin Glu Gly Ala Lys Met Ala 
260 265 270 



Val Arg Leu Ala Gly Gly Leu Gin Lys Met Val Ala Leu Leu Asn Lys 
275 280 285 



Thr Asn val Lys Phe Leu Ala lie Thr Thr Asp cys Leu Gin lie Leu 
290 295 300 



Ala Tyr Gly Asn Gin Glu Ser Lys Leu lie lie Leu Ala Ser Gly Gly 
305 310 315 320 



Pro Gin Ala Leu Val Asn He Met Arg Thr Tyr Thr Tyr Glu Lys Leu 
325 330 335 



Leu Trp Thr Thr Ser Arg Val Leu Lys Val Leu Ser Val Cys Ser Ser 
340 345 350 



Asn Lys Pro Ala lie Val Glu Ala Gly Gly Met Gin Ala Leu Gly Leu 
355 360 365 



His Leu Thr Asp pro ser Gin Arg Leu val Gin Asn Cys Leu Trp Thr 
370 375 380 



Leu Arg Asn Leu Ser Asp Ala Ala Thr Lys Gin Glu Gly Met Glu Gly 
385 390 395 400 



Leu Leu Gly Thr Leu Val Gin Leu Leu Gly Ser Asp Asp lie Asn Val 



405 



410 



415 
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Val Thr Cys Ala Ala Gly lie Leu Ser Asn Leu Thr Cys Asn Asn Tyr 
420 425 430 

Lys Asn Lys Met Met val cys Gin val Gly Gly lie Glu Ala Leu val 
435 440 445 

Arg Thr Val Leu Arg Ala Gly Asp Arg Glu Asp lie Thr Glu Pro Ala 
450 455 460 

lie Cys Ala Leu Arg His Leu Thr Ser Arg His Gin Glu Ala Glu Met 
465 470 475 480 

Ala Gin Asn Ala Val Arg Leu His Tyr Gly Leu pro Val val val Lys 
485 490 495 

Leu Leu His Pro Pro Ser His Trp Pro Leu lie Lys Ala Thr Val Gly 
500 505 510 

Leu lie Arg Asn Leu Ala Leu Cys Pro Ala Asn His Ala pro Leu Arg 
515 520 525 

Glu Gin Gly Ala lie Pro Arg Leu Val Gin Leu Leu Val Arg Ala His 
530 535 540 

Gin Asp Thr Gin Arg Arg Thr Ser Met Gly Gly Thr Gin Gin Gin Phe 
545 550 555 560 

val Glu Gly val Arg Met Glu Glu He val Glu Gly cys Thr Gly Ala 
565 570 575 

Leu His lie Leu Ala Arg Asp Val His Asn Arg lie Val lie Arg Gly 
580 585 590 

Leu Asn Thr lie Pro Leu Phe Val Gin Leu Leu Tyr ser Pro lie Glu 
595 600 605 

Asn lie Gin Arg Val Ala Ala Gly Val Leu Cys Glu Leu Ala Gin Asp 
610 615 620 

Lys Glu Ala Ala Glu Ala lie Glu Ala Glu Gly Ala Thr Ala Pro Leu 
625 630 635 640 

Thr Glu Leu Leu His ser Arg Asn Glu Gly val Ala Thr Tyr Ala Ala 
645 650 655 

Ala val Leu Phe Arg Met Ser Glu Asp Lys Pro Gin Asp Tyr Lys Lys 
660 665 670 

Arq Leu Ser Val Glu Leu Thr Ser Ser Leu Phe Arg Thr Glu Pro Met 
675 680 685 
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Ala Trp Asn Glu Thr Ala Asp Leu Gly Leu Asp lie Gly Ala Gin Gly 

690 695 700 

Glu Pro Leu Gly Tyr Arg Gin Asp Asp Pro Ser Tyr Arg Ser phe His 

705 710 715 720 

ser Gly Gly Tyr Gly Gin Asp Ala Leu Gly Met Asp pro Met Met Glu 

725 730 735 

His Glu Met Gly Gly His His Pro Gly Ala Asp Tyr Pro Val Asp Gly 



Leu Pro Asp "Leu Gly His Ala Gin Asp Leu Met Asp Gly Leu pro Pro - 

755 760 765 

Gly Asp ser Asn Gin Leu Ala Trp Phe Asp Thr Asp Leu 

770 775 780 

<210> 12 

<211> 2500 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> TCF-4 

<400> 12 

cggggggatc ttggctgtgt gtctgcggat ctgtagtggc ggcggcggcg gcggcggcgg 60 

ggaggcagca ggcgcgggag cgggcgcagg agcaggcggc ggcggtggcg gcggcggtta 120 

gacatgaacg ccgcctcggc gccggcggtg cacggagagc cccttctcgc gcgcgggcgg 180 

tttgtgtgat tttgctaaaa tgcatcacca acagcgaatg gctgccttag ggacggacaa 240 

agagctgagt gatttactgg atttcagtgc gatgttttca cctcctgtga gcagtgggaa 300 

aaatggacca acttctttgg caagtggaca ttttactggc tcaaatgtag aagacagaag 360 

tagctcaggg tcctggggga atggaggaca tccaagcccg tccaggaact atggagatgg 420 

gactccctat gaccacatga ccagcaggga ccttgggtca catgacaatc tctctccacc 480 

ttttgtcaat tccagaatac aaagtaaaac agaaaggggc tcatactcat cttatgggag 540 

agaatcaaac ttacagggtt gccaccagca gagtctcctt ggaggtgaca tggatatggg 600 

caacccagga accctttcgc ccaccaaacc tggttcccag tactatcagt attctagcaa 660 

taatccccga aggaggcctc ttcacagtag tgccatggag gtacagacaa agaaagttcg 720 

aaaagttcct ccaggtttgc catcttcagt ctatgctcca tcagcaagca ctgccgacta 780 

caatagggac tcgccaggct atccttcctc caaaccagca accagcactt tccctagctc 840 

cttcttcatg caagatggcc atcacagcag tgacccttgg agctcctcca gtgggatgaa 900 

tcagcctggc tatgcaggaa tgttgggcaa ctcttctcat attccacagt ccagcagcta 960 
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ctgtagcctg 




39467A.txt, 
aacgtttgag ctat ccat ca 


.txt 




1020 


ttrrantrtt 


cctccgatgt 


ccactttcca tcgtagtggt 


acaaaccatt 


acagcacctc 


1080 


ttcctgtacg 






g caaatag ag 


gaagcggggc 


1140 


ay<_t_y y*-ay*- 


tcccagactg 


gagatgctct ggggaaagca 


cttgcttcga 


tctattctcc 


1200 


agatcacact 


aacaacagct 


tttcatcaaa ccctt caact 


cctgttggct 


ctcctccatc 


1260 


t ct c t c a. g c a. 


ggcacagct g 


tttggtctag aaatggagga 


caggcctcat 


cgtctcctaa 


1320 


ttatgaagga 






ga"tcg"t"t"tag 


aaagactgga 


1380 


tgatgctatt 


catgttctcc 


ggaaccatgc ag tgggccca 


tccacagcta 


tgcctggtgg 


1440 


tcatggggac 


atgcatggaa 


tcattggacc ttctcataat 


ggagccatgg 


ataatctaaa 
a L aa L aaa 


1500 


ctcagggtat 


ggaaccggcc 




tcactcatgg 


tggggaccca 


1560 


tcgtgaagat 


99cgtggccc 


tglgagg«g cclttctctt 


ctgccaaacc 


aggttccggt 


1620 


tccacagctt 


cctgtccagt 


ctgcgacttc c cc tgacctg 


aacccacccc 


aggaccctta 


1680 


caga.ggca.tg 


ccaccaggac 




tctggcagct 


ctgagatcaa 


1740 


atccgatgac 


g ^.g 9 9 g^.^ g 


agaacctgca agacacgaaa 


tcttcggagg 


acaagaaatt 


1800 


agatgacgac 


aagaaggata 




aatgacgatg 


aggacctgac 


1860 


acca.ga.gcag 


3- ^9 9 c a g a g c 


gtgagaagga gcggaggatg 


gccaacaatg 


cccgagagcg 


1920 


tctgcgggtc 


cgtgacatca 


acgaggcttt caaagagct c 


ggccgcatgg 


tgcagctcca 


1980 


cctcaagagt 


gacaagcccc 


agaccaagct cctgatcct c 


caccaggcgg 


tggccgtcat 


2040 


cctcagtctg 


gagcagcaag 


tccgagaaag gaat ctgaat 


ccgaaagctg 


cgtgtctgaa 


2100 


aagaagggag 


gaagagaagg 




ctctccttgg 


ccggcccaca 


2160 


ccct ggaatg 




cgaat cacat gggacagatg 


taaaagggtc 


caagttgcca 


2220 


cattgcttca 


ttaaaacaag 


agaccacttc cttaacagct 


gtattatctt 


aaacccacat 


2280 


aaacacttct 


ccttaacccc 




agtctgagta 


gttatgaatc 


2340 


gcagacgcaa 


gaggtttcag 


cattcccaat tatcaaaaaa 


cagaaaaaca 


aaaaaaagaa 


2400 


agaaaaaagt 


gcaacttgag 


ggacgacttt ctttaacata 


tcattcagaa 


tgtgcaaagc 


2460 


agtatgtaca 


ggctgagaca 


cagcccagag actgaacggc 






2500 



<210> 13 

<211> 667 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> TCF-4 



Met His His Gin Gin Arg Met Ala Ala Leu Gly Thr Asp Lys Glu Leu 
15 10 15 
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Ser Asp Leu Leu Asp Phe Ser Ala Met phe Ser Pro Pro Val Ser Ser 
20 25 30 

Gly Lys Asn Gly Pro Thr ser Leu Ala ser Gly His Phe Thr Gly ser 
35 40 45 

Asn Val Glu Asp Arg Ser ser Ser Gly ser Trp Gly Asn Gly Gly His 
50 55 60 

Pro ser Pro ser Arg Asn Tyr Gly Asp Gly Thr pro Tyr Asp His Met 
65 70 75 80 

Thr Ser Arg Asp Leu Gly Ser His Asp Asn Leu Ser Pro Pro Phe Val 
85 90 95 

Asn Ser Arg lie Gin Ser Lys Thr Glu Arg Gly Ser Tyr Ser Ser Tyr 
100 105 110 

Gly Arg Glu ser Asn Leu Gin Gly cys His Gin Gin Ser Leu Leu Gly 
115 120' 125 

Gly Asp Met Asp Met Gly Asn Pro Gly Thr Leu Ser Pro Thr Lys Pro 
130 135 140 

Gly ser Gin Tyr Tyr Gin Tyr Ser Ser Asn Asn pro Arg Arg Arg pro 
145 150 155 160 

Leu His ser ser Ala Met Glu Val Gin Thr Lys Lys Val Arg Lys Val 
165 170 175 

Pro Pro Gly Leu Pro Ser Ser Val Tyr Ala Pro Ser Ala Ser Thr Ala 
180 185 190 

Asp Tyr Asn Arg Asp ser Pro Gly Tyr Pro ser ser Lys Pro Ala Thr 
195 200 205 

Ser Thr Phe Pro Ser Ser Phe Phe Met Gin Asp Gly His His Ser Ser 
210 215 220 

Asp pro Trp ser ser Ser ser Gly Met Asn Gin pro Gly Tyr Ala Gly 
225 230 235 240 

Met Leu Gly Asn ser ser His lie Pro Gin Ser Ser ser Tyr Cys ser 
245 250 255 

Leu His Pro His Glu Arg Leu Ser Tyr Pro Ser His Ser Ser Ala Asp 
260 265 270 

lie Asn Ser Ser Leu Pro Pro Met Ser Thr Phe His Arg Ser Gly Thr 
275 280 285 
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Asn Hi's Tyr Ser Thr Ser Ser Cys Thr Pro Pro Ala Asn Gly Thr Asp 
290 295 300 



Ser lie Met Ala Asn Arg Gly Ser Gly Ala Ala Gly Ser ser Gin Thr 
305 310 315 320 



Gly Asp Ala Leu Gly Lys Ala Leu Ala Ser lie Tyr Ser Pro Asp His 
325 330 335 



Thr Asn Asn Ser Phe Ser Ser Asn Pro ser Thr Pro Val Gly Ser Pro 
340 345 350 



Pro Ser Leu Ser Ala Gly Thr Ala Val Trp Ser Arg Asn Gly Gly Gin 
355 360 365 



Ala Ser Ser Ser Pro Asn Tyr Glu Gly Pro Leu His Ser Leu Gin Ser 
370 375 380 



Arg lie Glu Asp Arg Leu Glu Arg Leu Asp Asp Ala lie His Val Leu 
385 390 395 400 



Arg Asn His Ala Val Gly Pro Ser Thr Ala Met Pro Gly Gly His Gly 
405 410 415 



Asp Met His Gly lie lie Gly Pro Ser His Asn Gly Ala Met Gly Gly 
420 425 430 



Leu Gly Ser Gly Tyr Gly Thr Gly Leu Leu ser Ala Asn Arg His Ser 
435 440 445 



Leu Met Val Gly Thr His Arg Glu Asp Gly Val Ala Leu Arg Gly Ser 
450 455 460 



His Ser Leu Leu Pro Asn Gin Val Pro Val Pro Gin Leu Pro Val Gin 
465 470 475 480 



Ser Ala Thr Ser Pro Asp Leu Asn Pro Pro Gin Asp Pro Tyr Arg Gly 
485 490 495 



Met Pro Pro Gly Leu Gin Gly Gin Ser Val Ser Ser Gly Ser Ser Glu 
500 505 510 



lie Lys ser Asp Asp Glu Gly Asp Glu Asn Leu Gin Asp Thr Lys ser 
515 520 525 



Ser Glu Asp Lys Lys Leu Asp Asp Asp Lys Lys Asp lie Lys Ser lie 
530 535 540 



Thr ser Asn Asn Asp Asp Glu Asp Leu Thr Pro Glu Gin Lys Ala Glu 



545 



550 



555 



560 
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Arg Glu Lys Glu Arg Arg Met Ala Asn Asn Ala Arg Glu Arg Leu Arg 
565 570 575 

val Arg Asp lie Asn Glu Ala Phe Lys Glu Leu Gly Arg Met val Gin 
580 585 590 

Leu His Leu Lys Ser Asp Lys Pro Gin Thr Lys Leu Leu lie Leu His 
595 600 605 

Gin Ala val Ala val lie Leu ser Leu Glu Gin Gin val Arg Glu Arg 
610 615 620 

Asn Leu Asn Pro Lys Ala Ala Cys Leu Lys Arg Arg Glu Glu Glu Lys 
625 630 635 640 

Val Ser Ser Glu Pro Pro Pro Leu ser Leu Ala Gly Pro His Pro Gly 
645 650 655 



Met Gly Asp Ala ser Asn His Met Gly Gin Met 
660 665 



<210> 14 

<211> 9312 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Notch-1 












<400> 14 
atgccgccgc 


tcctggcgcc 


cctgctctgc 


ctggcgctgc 


tgcccgcgct 


cgccgcacga 


60 


ggcccgcgat 


gctcccagcc 


cggtgagacc 


tgcctgaatg 


gcgggaagtg 


tgaagcggcc 


120 


aatggcacgg 


aggcctgcgt 


ctgtggcggg 


gccttcgtgg 


gcccgcgatg 


ccaggacccc 


180 


aacccgtgcc 


tcagcacccc 


ctgcaagaac 


gccgggacat 


gccacgtggt 


ggaccgcaga 


240 


ggcgtggcag 


actatgcctg 


cagctgtgcc 


ctgggcttct 


ctgggcccct 


ctgcctgaca 


300 


cccctggaca 


atgcctgcct 


caccaacccc 


tgccgcaacg 


ggggcacctg 


cgacctgctc 


360 


acgctgacgg 


agtacaagtg 


ccgctgcccg 


cccggctggt 


cagggaaatc 


gtgccagcag 


420 


gctgacccgt 


gcgcctccaa cccctgcgcc 


aacggtggcc 


agtgcctgcc 


cttcgaggcc 


480 


tcctacatct 


gccactgccc acccagcttc 


catggcccca 


cctgccggca 


ggatgtcaac 


540 


gagtgtggcc 


agaagcccgg 


gctttgccgc 


cacggaggca 


cctgccacaa 


cgaggtcggc 


600 


tcctaccgct 


gcgtctgccg 


cgccacccac 


actggcccca 


actgcgagcg gccctacgtg 


660 


ccctgcagcc 


cctcgccctg 


ccagaacggg 


ggcacctgcc 


gccccacggg 


cgacgtcacc 


720 


cacgagtgtg 


cctgcctgcc aggcttcacc 


ggccagaact 


gtgaggaaaa tatcgacgat 


780 


tgtccaggaa 


acaactgcaa gaacgggggt 


gcctgtgtgg 


acggcgtgaa 


cacctacaac 


840 
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aggatgtgga 




900 


ctgatgccaa atgcctgcca gaacggcggg 


acctgccaca 


acacccacgg 


tggctacaac 


960 


tgcgtgtgtg tcaacggctg 


gactggtgag 


gactgcagcg 


agaacattga 


tgactgtgcc 


1020 


agcgccgcct 


gcttccacgg 


cgccacctgc 


catgaccgtg 


tggcctcctt 


ctactgcgag 


1080 


tgtccccatg gccgcacagg tctgctgtgc 


cacctcaacg 


acgcatgcat 


cagcaacccc 


1140 


tgtaacgagg gctccaactg 


cgacaccaac 


cctgtcaatg 


gcaaggccat 


ctgcacctgc 


1200 


ccctcggggt acacgggccc ggcctgcagc 


caggacgtgg 


atgagtgctc 


gctgggtgcc 


1260 


aacccctgcg agcatgcggg 


caagtgcatc 


aacacgctgg 


gctccttcga 


gtgccagtgt 


1320 


ctgcagggct acacgggccc 


ccgatgcgag 


atcgacgtca 


acgagtgcgt 


ctcgaacccg 


1380 


tgccagaacg 


acgccacctg 


cctggaccag 


attggggagt 


tccagtgcat 


ctgcatgccc 


1440 


ggctacgagg 


gtgtgcactg 


cgaggtcaac 


acagacgagt 


gtgccagcag 


cccctgcctg 


1500 


cacaatggcc 


gctgcctgga caagatcaat 


gagttccagt 


gcgagtgccc 


cacgggcttc 


1560 


actgggcatc 


tgtgccagta cgatgtggac 


gagtgtgcca 


gcaccccctg 


caagaatggt 


1620 


gccaagtgcc 


tggacggacc 


caacacttac 


acctgtgtgt 


gcacggaagg 


gtacacgggg 


1680 


acgcactgcg 


aggtggacat 


cgatgagtgc 


gaccccgacc 


cctgccacta 


cggctcctgc 


1740 


aaggacggcg 


tcgccacctt 


cacctgcctc 


tgccgcccag 


gctacacggg 


ccaccactgc 


1800 


gagaccaaca 


tcaacgagtg 


ctccagccag 


ccctgccgcc 


acaaaaacac 


ctgccaggac 


1860 


cgcgacaacg 


cctacctctg 


cttctgcctg 


aaggggacca 


caggacccaa 


ctgcgagatc 


1920 


aacctggatg 


actgtgccag 


cagcccctgc 


gactcgggca 


cctgtctgga 


caagatcgat 


1980 


ggctacgagt 


gtgcctgtga 


gccgggctac 


acagggagca 


tgtgtaacat 


caacatcgat 


2040 


gagtgtgcgg 


gcaacccctg 


ccacaacggg 


ggcacctgcg 


aggacggcat 


caatggcttc 


2100 


acctgccgct 


gccccgaggg 


ctaccacgac 


cccacctgcc 


tgtctgaggt 


caatgagtgc 


2160 


aacagcaacc 


cctgcgtcca 


cggggcctgc 


cgggacagcc 


tcaacgggta 


caagtgcgac 


2220 


tgtgaccctg 


ggtggagtgg 


gaccaactgt 


gacatcaaca 


acaatgagtg 


tgaatccaac 


2280 


ccttgtgtca 


acggcggcac 


ctgcaaagac 


atgaccagtg 


gctacgtgtg 


cacctgccgg 


2340 


gagggcttca 


gcggtcccaa 


ctgccagacc 


aacatcaacg 


agtgtgcgtc 


caacccatgt 


2400 


ctgaaccagg 


gcacgtgtat 


tgacgacgtt 


gccgggtaca 


agtgcaactg 


cctgctgccc 


2460 


tacacaggtg 


ccacgtgtga 


ggtggtgctg 


gccccgtgtg 


cccccagccc 


ctgcagaaac 


2520 


ggcggggagt 


gcaggcaatc 


cgaggactat 


gagagcttct 


cctgtgtctg 


ccccacgggc 


2580 


tggcaagcag 


ggcagacctg 


tgaggtcgac 


atcaacgagt 


gcgttctgag 


cccgtgccgg 


2640 


cacggcgcat 


cctgccagaa 


cacccacggc 


ggctaccgct 


gccactgcca 


ggccggctac 


2700 


agtgggcgca 


actgcgagac cgacatcgac 


gactgccggc 


ccaacccgtg 


tcacaacggg 


2760 


ggctcctgca 


cagacggcat caacacggcc 


ttctgcgact 


gcctgcccgg 


cttccggggc 


2820 


actttctgtg 


aggaggacat caacgagtgt 


gccagtgacc 


cctgccgcaa 


cggggccaac 


2880 
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tgcacggact 
tgtgagaaca 
gacggcatca 
cacgatgtca 
tgcggctcct 
gtgcactggt 
cagtaccgct 
tcctgtgagg 
gggctctgtg 
agctactgtg 
tgcacggact 
tgctctgagg 
gacctcccca 
atcaacgtgg 
aacaacggca 
gtgggtgagc 
ggcacccaga 
accgggcgcc 
ggcacctgcg 
ttcgagggcg 
ggcggcacat 
ggccccgaat 
caggggacct 
ttcaacgggc 
atccccccgc 
aacaaggtct 
tccctcaact 
ttcagtgacg 
gactgccagc 
ttcagcgacg 
gactgtgcgg 
atgccgccgg 
ctgcacacca 
tacggccgcg 



gcgtggacag ctacacgtgc 
acacgcctga ctgcacagag 
actcgttcac ctgcctgtgt 
atgagtgcga ctcacagccc 
acaggtgcac ctgcccccag 
gtgactcctc gccctgcaag 
gcgagtgccc cagcggctgg 
tggctgcgca gcgacaaggt 
tggacgcggg caacacgcac 
aggacctggt ggacgagtgc 
acctgggcgg ctactcctgc 
agatcgacga gtgcctctcc 
acacctacaa gtgctcctgc 
acgactgcaa tccccccgtt 
cctgcgtgga ccaggtgggc 
gctgtgaggg ggatgtcaac 
actgcgtgca gcgcgtcaat 
gctgcgagtc cgtcatcaat 
ccgtggcctc caacaccgcc 
ccacgtgtga gaatgacgct 
gcatctccgg cccgcgcagc 
gccagttccc ggccagcagc 
gtgagcccac atccgagagc 
tcttgtgcca catcctggac 
cgctgatcga ggaggcgtgc 
gcagcctgca gtgcaacaac 
tcaatgaccc ctggaagaac 
gccactgtga cagccagtgc 
gtgcggaagg ccagtgcaac 
ggcactgcga ccagggctgc 
agcatgtacc cgagaggctg 
agcagctgcg caacagctcc 
acgtggtctt caagcgtgac 
aggaggagct gcgcaagcac 



39467A.txt.txt 
acctgccccg caggcttcag 

agctcctgct tcaacggtgg 

ccacccggct tcacgggcag 

tgcctgcatg gcggcacctg 

ggctacactg gccccaactg 

aacggcggca aatgctggca 

accggccttt actgcgacgt 

gttgacgttg cccgcctgtg 

cactgccgct gccaggcggg 

tcacccagcc cctgccagaa 

aagtgcgtgg ccggctacca 

cacccctgcc agaacggggg 

ccacggggca ctcagggtgt 

gaccccgtgt cccggagccc 

ggctacagct gcacctgccc 

gagtgcctgt ccaatccctg 

gacttccact gcgagtgccg 

ggctgcaaag gcaagccctg 

cgcgggttca tctgcaagtg 

cgtacctgcg gcagcctgcg 

cccacctgcc tgtgcctggg 

ccctgcctgg gcggcaaccc 

cccttctacc gttgcctgtg 

tacagcttcg ggggtggggc 

gagctgcccg agtgccagga 

cacgcgtgcg gctgggacgg 

tgcacgcagt ctctgcagtg 

aactcagccg gctgcctctt 

cccctgtacg accagtactg 

aacagcgcgg agtgcgagtg 

gcggccggca cgctggtggt 

ttccacttcc tgcgggagct 

gcacacggcc agcagatgat 

cccatcaagc gtgccgccga 



cgggatccac 


2940 


cacctgcgtg 


3000 


ctactgccag 


3060 


tcaggacggc 


3120 


ccagaacctt 


3180 


gacccacacc 


3240 


gcccagcgtg 


3300 


ccagcatgga 


3360 


ctacacaggc 


3420 


cggggccacc 


3480 


cggggtgaac 


3540 


cacctgcctc 


3600 


gcactgtgag 


3660 


caagtgcttt 


3720 


gccgggcttc 


3780 


cgacgcccgt 


3840 


tgctggtcac 


3900 


caagaatggg 


3960 


ccctgcgggc 


4020 


ctgcctcaac 


4080 


ccccttcacg 


4140 


ctgctacaac 


4200 


ccccgccaaa 


4260 


cgggcgcgac 


4320 


ggacgcgggc 


4380 


cggtgactgc 


4440 


ctggaagtac 


4500 


cgacggcttt 


4560 


caaggaccac 


4620 


ggacgggctg 


4680 


ggtggtgctg 


4740 


cagccgcgtg 


4800 


cttcccctac 


4860 


gggctgggcc 


4920 



42/166 



WO 2005/014854 PCT/EP2004/008819 
39467A.tXt.txt 

gcacctgacg ccctgctggg ccaggtgaag gcctcgctgc tccctggtgg cagcgagggt 4980 

gggcggcggc ggagggagct ggaccccatg gacgtccgcg gctccatcgt ctacctggag 5040 

attgacaacc ggcagtgtgt gcaggcctcc tcgcagtgct tccagagtgc caccgacgtg 5100 

gccgcattcc tgggagcgct cgcctcgctg ggcagcctca acatccccta caagatcgag 5160 

gccgtgcaga gtgagaccgt ggagccgccc ccgccggcgc agctgcactt catgtacgtg 5220 

gcggcggccg cctttgtgct tctgttcttc gtgggctgcg gggtgctgct gtcccgcaag 5280 

cgccggcggc agcatggcca gctctggttc cctgagggct tcaaagtgtc tgaggccagc 5340 

aagaagaagc ggcgggagcc cctcggcgag gactccgtgg gcctcaagcc cctgaagaac 5400 

gcttcagacg gtgccctcat ggacgacaac cagaatgagt ggggggacga ggacctggag 5460 

accaagaagt tccggttcga ggagcccgtg gttctgcctg acctggacga ccagacagac 5520 

caccggcagt ggactcagca gcacctggat gccgctgacc tgcgcatgtc tgccatggcc 5580 

cccacaccgc cccagggtga ggttgacgcc gactgcatgg acgtcaatgt ccgcgggcct 5640 

gatggcttca ccccgctcat gatcgcctcc tgcagcgggg gcggcctgga gacgggcaac 5700 

agcgaggaag aggaggacgc gccggccgtc' atctccgact tcatctacca gggcgccagc 5760 

ctgcacaacc agacagaccg cacgggcgag accgccttgc acctggccgc ccgctactca 5820 

cgctctgatg ccgccaagcg cctgctggag gccagcgcag atgccaacat ccaggacaac 5880 

atgggccgca ccccgctgca tgcggctgtg tctgccgacg cacaaggtgt cttccagatc 5940 

ctgatccgga accgagccac agacctggat gcccgcatgc atgatggcac gacgccactg 6000 

atcctggctg cccgcctggc cgtggagggc atgctggagg acctcatcaa ctcacacgcc 6060 

gacgtcaacg ccgtagatga cctgggcaag tccgccctgc actgggccgc cgccgtgaac 6120 

aatgtggatg ccgcagttgt gctcctgaag aacggggcta acaaagatat gcagaacaac 6180 

agggaggaga cacccctgtt tctggccgcc cgggagggca gctacgagac cgccaaggtg 6240 

ctgctggacc actttgccaa ccgggacatc acggatcata tggaccgcct gccgcgcgac 6300 

atcgcacagg agcgcatgca tcacgacatc gtgaggctgc tggacgagta caacctggtg 6360 

cgcagcccgc agctgcacgg agccccgctg gggggcacgc ccaccctgtc gcccccgctc 6420 

tgctcgccca acggctacct gggcagcctc aagcccggcg tgcagggcaa gaaggtccgc 6480 

aagcccagca gcaaaggcct ggcctgtgga agcaaggagg ccaaggacct caaggcacgg 6540 

aggaagaagt cccaggacgg caagggctgc ctgctggaca gctccggcat gctctcgccc 6600 

gtggactccc tggagtcacc ccatggctac ctgtcagacg tggcctcgcc gccactgctg 6660 

ccctccccgt tccagcagtc tccgtccgtg cccctcaacc acctgcctgg gatgcccgac 6720 

acccacctgg gcatcgggca cctgaacgtg gcggccaagc ccgagatggc ggcgctgggt 6780 

gggggcggcc ggctggcctt tgagactggc ccacctcgtc tctcccacct gcctgtggcc 6840 

tctggcacca gcaccgtcct gggctccagc agcggagggg ccctgaattt cactgtgggc 6900 

gggtccacca gtttgaatgg tcaatgcgag tggctgtccc ggctgcagag cggcatggtg 6960 
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ccgaaccaat 


acaaccctct 


39467A.txt 
gcgggggagt gtggcaccag 


.txt 

gccccctgag 


cacacaggcc 


7020 


ccctccctgc 


agcatggcat 


ggtaggcccg ctgcacagta 


gccttgctgc 


cagcgccctg 


7080 


tcccagatga 


tgagctacca 


gggcctgccc agcacccggc 


tggccaccca 


gcctcacctg 


7140 


gtgcagaccc 


ageaggtgea 


gccacaaaac ttacagatgc 


agcagcagaa 


cctgcagcca 


7200 


gcaaacatcc 


agcagcagca 


aagcctgcag ccgccaccac 


caccaccaca 


gccgcacctt 


7260 


ggcgtgagct 


cagcagccag 


cggccacctg ggceggaget 


tcctgagtgg 


agagecgage 


7320 


caggcagacg 


tgcagccact 


gggccccagc agectggegg 


tgcacactat 


tctgccccag 


7380 


gagagccccg 


ccctgcccac 


gtcgctgcca tcctcgctgg 


tcccacccgt 


gaccgcagcc 


7440 


cagttcctga 


cgcccccctc 


gcagcacagc tactcctcgc 


ctgtggacaa 


cacccccagc 


7500 


caccagctac 


aggtgcctga 


gcaccccttc ctcaccccgt 


cccctgagtc 


ccctgaccag 


75.60 


tggtccagct 


cgtccccgca 


ttccaacgtc tccgactggt 


ccgagggcgt 


ctccagccct 


7620 


cccaccagca 


tgcagtccca 


gatcgcccgc attceggagg 


ccttcaagta 


aacggcgcgc 


7680 


cccacgagac 


cccggcttcc 


tttcccaagc ettegggegt 


ctgtgtgcgc 


tctgtggatg 


7740 


ccagggccga 


ccagaggagc 


ctttttaaaa cacatgtttt 


tatacaaaat 


aagaacgagg 


7800 


attttaattt 


tttttagtat 


ttatttatgt acttttattt 


tacacagaaa 


cactgccttt 


7860 


ttatttatat 


gtactgtttt 


atctggcccc aggtagaaac 


ttttatctat 


tctgagaaaa 


7920 


caagcaagtt 


ctgagageca 


gggttttcct aegtaggatg 


aaaagattct 


tctgtgttta 


7980 


taaaatataa 


acaaagattc 


atgatttata aatgccattt 


atttattgat 


tccttttttc 


8040 


aaaatccaaa 


aagaaatgat 


gttggagaag ggaagttgaa 


cgagcatagt 


ccaaaaagct 


8100 


cctggggcgt 


ccaggccgcg 


ccctttcccc gacgcccacc 


caaccccaag 


ccagcccggc 


8160 


eg etc caeca 


gcatcacctg 


cctgttagga gaagctgeat 


ccagaggcaa 


aeggaggcaa 


8220 


agctggctca 


ccttccgcac 


gcggattaat ttgcatctga 


aataggaaac 


aagtgaaagc 


8280 


atatgggtta 


gatgttgcca 


tgtgttttag atggtttctt 


geaagcatge 


ttgtgaaaat 


8340 


gtgttctcgg 


agtgtgtatg 


ccaagagtgc acccatggta 


ccaatcatga 


atctttgttt 


8400 


caggttcagt 


attatgtagt 


tgttcgttgg ttatacaagt 


tcttggtccc 


tccagaacca 


8460 


ccccggcccc 


ctgcccgttc 


ttgaaatgta ggcatcatgc 


atgtcaaaca 


tgagatgtgt 


8520 


ggactgtggc 


acttgectgg 


gtcacacacg gaggcatcct 


acccttttct 


ggggaaagac 


8580 


actgcctggg 


ctgaccccgg 


tggcggcccc agcacctcag 


cctgcacagt 


gtcccccagg 


8640 


ttccgaagaa 


gatgetccag 


caacacagcc tgggccccag 


ctcgcgggac 


ccgacccccc 


8700 


gtgggctCCC 


gtgttttgta 


ggagacttgc cagagceggg 


cacattgagc 


tgtgcaacgc 


8760 


*-y L yyy L y 


gtcctttggt 


cctgtccccg cagccctggc 


agggggcatg 


eggtegggea 


8820 


ggggctggag 


ggaggcgggg 


gctgcccttg ggccacccct 


cctagtttgg 


gaggagcaga 


8880 


tttttgcaat 


accaagtata 


gectatggea gaaaaaatgt 


ctgtaaatat 


gtttttaaag 


8940 


gtggattttg 


tttaaaaaat 


cttaatgaat gagtctgttg 


tgtgtcatgc 


cagtgaggga 


9000 
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cgtcagactt ggctcagctc ggggagcctt agccgcccat gcactgggga cgctccgctg 9060 

ccgtgccgcc tgcactcctc agggcagcct cccccggctc tacgggggcc gcgtggtgcc 9120 

atccccaggg ggcatgacca gatgcgtccc aagatgttga tttttactgt gttttataaa 9180 

atagagtgta gtttacagaa aaagacttta aaagtgatct acatgaggaa ctgtagatga 9240 

tgtatttttt tcatcttttt tgttaactga tttgcaataa aaatgatact gatggtgaaa 9300 

aaaaaaaaaa aa 9312 

<210> 15 

<211> 2556 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Notch-1 



Met Pro Pro Leu Leu Ala Pro Leu Leu Cys Leu Ala Leu Leu Pro Ala 
15 10 15 

Leu Ala Ala Arg Gly Pro Arg Cys Ser Gin Pro Gly Glu Thr Cys Leu 
20 25 30 

Asn Gly Gly Lys cys Glu Ala Ala Asn Gly Thr Glu Ala cys val cys 
35 40 45 

Gly Gly Ala Phe Val Gly Pro Arg Cys Gin Asp Pro Asn Pro Cys Leu 
50 55 60 

Ser Thr Pro Cys Lys Asn Ala Gly Thr Cys His Val Val Asp Arg Arg 
65 70 75 80 

Gly Val Ala Asp Tyr Ala Cys Ser Cys Ala Leu Gly Phe Ser Gly Pro 
85 90 95 

Leu Cys Leu Thr Pro Leu Asp Asn Ala Cys Leu Thr Asn Pro Cys Arg 
100 105 110 

Asn Gly Gly Thr cys Asp Leu Leu Thr Leu Thr Glu Tyr Lys Cys Arg 
115 120 125 

Cys Pro Pro Gly Trp Ser Gly Lys Ser Cys Gin Gin Ala Asp Pro Cys 
130 135 ' 140 

Ala ser Asn Pro Cys Ala Asn Gly Gly Gin Cys Leu Pro Phe Glu Ala 
145 150 155 160 

ser Tyr He Cys His Cys Pro Pro ser Phe His Gly Pro Thr cys Arg 
165 170 175 
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Gin Asp val Asn Glu cys Gly Gin Lys Pro Gly Leu cys Arg His Gly 
180 185 190 

Gly Thr Cys His Asn Glu Val Gly Ser Tyr Arg Cys Val Cys Arg Ala 
195 200 205 

Thr His Thr Gly pro Asn cys Glu Arg Pro Tyr Val Pro cys Ser Pro 
210 215 220 

Ser Pro Cys Gin Asn Gly Gly Thr Cys Arg Pro Thr Gly Asp Val Thr 
225 230 235 240 

His Glu Cys Ala Cys Leu Pro Gly Phe Thr Gly Gin Asn Cys Glu Glu 
245 250 . 255 

Asn lie Asp Asp Cys Pro Gly Asn Asn cys Lys Asn Gly Gly Ala cys 
260 265 270 

Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro Glu Trp Thr 
275 280 285 

Gly Gin Tyr Cys Thr Glu Asp Val Asp Glu Cys Gin Leu Met Pro Asn 
290 295 300 

Ala cys Gin Asn Gly Gly Thr cys His Asn Thr His Gly Gly Tyr Asn 
305 310 315 320 

Cys Val Cys Val Asn Gly Trp Thr Gly Glu Asp Cys Ser Glu Asn lie 
325 330 335 

Asp Asp Cys Ala ser Ala Ala cys phe His Gly Ala Thr cys His Asp 
340 ' 345 350 

Arg Val Ala Ser Phe Tyr Cys Glu Cys Pro His Gly Arg Thr Gly Leu 
355 360 365 

Leu Cys His Leu Asn Asp Ala Cys lie Ser Asn Pro Cys Asn Glu Gly 
370 375 380 

ser Asn cys Asp Thr Asn Pro val Asn Gly Lys Ala lie cys Thr Cys 
385 390 395 400 

Pro Ser Gly Tyr Thr Gly Pro Ala Cys Ser Gin Asp Val Asp Glu Cys 
405 410 415 

Ser Leu Gly Ala Asn Pro cys Glu His Ala Gly Lys cys lie Asn Thr 
420 425 430 

Leu Gly Ser Phe Glu cys Gin Cys Leu Gin Gly Tyr Thr Gly Pro Arg 
435 440 445 
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Cys Glu lie Asp val Asn Glu Cys Val Ser Asn Pro Cys Gin Asn Asp 
450 455 460 

Ala Thr cys Leu Asp Gin He Gly Glu phe Gin Cys lie Cys Met pro 
465 470 475 480 

Gly Tyr Glu Gly Val His Cys Glu val Asn Thr Asp Glu Cys Ala Ser 
485 490 495 

Ser Pro cys Leu His Asn Gly Arg cys Leu Asp Lys lie Asn Glu Phe 
500 505 510 

Gin Cys Glu cys Pro Thr Gly Phe Thr Gly His Leu Cys Gin Tyr Asp 
515 520 525 

val Asp Glu cys Ala ser Thr Pro Cys Lys Asn Gly Ala Lys Cys Leu 
530 535 540 

Asp Gly Pro Asn Thr Tyr Thr cys val cys Thr Glu Gly Tyr Thr Gly 
545 550 555 560 

Thr His cys Glu Val Asp lie Asp Glu Cys Asp Pro Asp Pro Cys His 
565 570 575 

Tyr Gly ser cys Lys Asp Gly Val Ala Thr Phe Thr cys Leu cys Arg 
580 585 590 

Pro Gly Tyr Thr Gly His His cys Glu Thr Asn lie Asn Glu cys ser 
595 600 605 

ser Gin Pro cys Arg His Gly Gly Thr cys Gin Asp Arg Asp Asn Ala 
610 615 620 

Tyr Leu Cys Phe cys Leu Lys Gly Thr Thr Gly Pro Asn cys Glu He 
625 630 635 640 

Asn Leu Asp Asp cys Ala Ser ser Pro Cys Asp ser Gly Thr cys Leu 
645 650 655 

Asp Lys He Asp Gly Tyr Glu cys Ala cys Glu Pro Gly Tyr Thr Gly 
660 665 670 

Ser Met cys Asn He Asn lie Asp Glu Cys Ala Gly Asn Pro cys His 
675 680 685 

Asn Gly Gly Thr cys Glu Asp Gly He Asn Gly Phe Thr Cys Arg cys 
690 695 700 

Pro Glu Gly Tyr His Asp Pro Thr cys Leu Ser Glu val Asn Glu cys 
705 " 710 715 720 
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Asn ser Asn Pro Cys val His Gly Ala Cys Arg Asp Ser Leu Asn Gly 
725 730 735 

Tyr Lys Cys Asp Cys Asp Pro Gly Trp Ser Gly Thr Asn Cys Asp lie 
740 745 750 

Asn Asn Asn Glu Cys Glu Ser Asn Pro Cys Val Asn Gly Gly Thr Cys 
755 760 765 

Lys Asp Met Thr Ser Gly Tyr Val Cys Thr Cys Arg Glu Gly Phe Ser 
770 775 780 

Gly Pro Asn Cys Gin Thr Asn lie Asn Glu-Cys Ala Ser Asn Pro Cys 
785 790 795 800 

Leu Asn Gin Gly Thr cys He Asp Asp val Ala Gly Tyr Lys cys Asn 
805 810 815 

cys Leu Leu Pro Tyr Thr Gly Ala Thr Cys Glu Val Val Leu Ala Pro 
820 825 830 

Cys Ala Pro Ser Pro Cys Arg Asn Gly Gly Glu Cys Arg Gin Ser Glu 
835 840 845 

Asp Tyr Glu ser Phe ser cys val cys Pro Thr Gly Trp Gin Ala Gly 
850 855 860 

Gin Thr Cys Glu Val Asp lie Asn Glu Cys Val Leu Ser Pro Cys Arg 
865 870 875 880 

His Gly Ala ser cys Gin Asn Thr His Gly Gly Tyr Arg cys His Cys 
885 890 895 

Gin Ala Gly Tyr ser Gly Arg Asn Cys Glu Thr Asp lie Asp Asp Cys 
900 905 910 

Arg Pro Asn Pro Cys His Asn Gly Gly Ser Cys Thr Asp Gly lie Asn 
915 920 925 

Thr Ala Phe cys Asp cys Leu Pro Gly Phe Arg Gly Thr Phe Cys Glu 
930 935 940 

Glu Asp lie Asn Glu Cys Ala Ser Asp Pro Cys Arg Asn Gly Ala Asn 
945 950 955 960 

cys Thr Asp Cys Val Asp Ser Tyr Thr Cys Thr Cys Pro Ala Gly Phe 
965 970 975 

ser Gly He His Cys Glu Asn Asn Thr pro Asp cys Thr Glu ser Ser 
980 985 990 
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Cys Phe Asn Gly Gly Thr Cys val Asp Gly lie Asn Ser Phe Thr Cys 
995 1000 1005 

Leu cys Pro Pro Gly phe Thr Gly Ser Tyr Cys Gin His Asp val 
1010 1015 1020 

Asn Glu Cys Asp Ser Gin Pro Cys Leu His Gly Gly Thr Cys Gin 
1025 1030 1035 

Asp Gly cys Gly Ser Tyr Arg cys Thr cys Pro Gin Gly Tyr Thr 
1040 1045 1050 

Gly Pro Asn Cys Gin Asn Leu Val His Trp cys Asp Ser Ser Pro 
1055 1060 1065 

cys Lys Asn Gly Gly Lys cys Trp Gin Thr His Thr Gin Tyr Arg 
1070 1075 1080 

Cys Glu Cys Pro Ser Gly Trp Thr Gly Leu Tyr Cys Asp Val Pro 
1085 1090 1095 

ser val Ser Cys Glu val Ala Ala Gin Arg Gin Gly Val Asp Val 
1100 1105 1110 

Ala Arg^ Leu cys Gin His Gly^ Gly Leu cys val Asp^ Ala Gly Asn 

Thr His His Cys Arg Cys Gin Ala Gly Tyr Thr Gly Ser Tyr Cys 
1130 1135 1140 

Glu Asp Leu val Asp Glu Cys Ser Pro Ser Pro cys Gin Asn Gly 
1145 1150 1155 

Ala Thr Cys Thr Asp Tyr Leu Gly Gly Tyr Ser cys Lys Cys Val 
1160 1165 1170 

Ala Gly Tyr His Gly Val Asn cys Ser Glu Glu lie Asp Glu Cys 
1175 1180 1185 

Leu ser His Pro cys Gin Asn Gly Gly Thr cys Leu Asp Leu Pro 
1190 1195 1200 

Asn Thr Tyr Lys Cys Ser Cys Pro Arg Gly Thr Gin Gly Val His 
1205 1210 1215 

Cys Glu He Asn Val Asp Asp Cys Asn Pro Pro Val Asp Pro Val 
1220 1225 1230 

Ser Arg Ser Pro Lys Cys Phe Asn Asn Gly Thr cys val Asp Gin 
1235 1240 1245 
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val Gly Gly Tyr Ser cys Thr Cys Pro Pro Gly Phe Val Gly Glu 
1250 125 5 1260 

Arg cys Glu Gly Asp val Asn Glu cys Leu ser Asn Pro cys Asp 
1265 1270 1275 

Ala Arg Gly Thr Gin Asn Cys val Gin Arg val Asn Asp Phe His 
1280 1285 1290 

cys Glu cys Arg Ala Gly His Thr Gly Arg Arg Cys Glu Ser val 
1295 1300 1305 

lie Asn Gly Cys Lys Gly Lys Pro Cys Lys Asn Gly Gly Thr Cys 



Ala val Ala ser Asn Thr Ala Arg Gly Phe lie cys Lys cys Pro 

1325 1330 1335 

Ala Gly Phe Glu Gly Ala Thr Cys Glu Asn Asp Ala Arg Thr Cys 

1340 1345 1350 

Gly ser Leu Arg Cys Leu Asn Gly Gly Thr Cys lie Ser Gly Pro 

1355 1360 1365 

Arg ser Pro Thr Cys Leu cys Leu' Gly pro Phe Thr Gly Pro Glu 



cys Gin Phe Pro Ala Ser Ser Pro Cys Leu Gly Gly Asn Pro Cys 

1385 1390 1395 

Tyr Asn Gin Gly Thr cys Glu Pro Thr ser Glu Ser Pro phe Tyr 

1400 1405 1410 

Arg Cys Leu Cys Pro Ala Lys Phe Asn Gly Leu Leu Cys His He 

1415 1420 1425 

Leu Asp Tyr Ser Phe Gly Gly Gly Ala Gly Arg Asp He Pro Pro 

1430 143 5 1440 

Pro Leu lie Glu Glu Ala cys Glu Leu Pro Glu Cys Gin Glu Asp 

1445 1450 1455 

Ala Gly Asn Lys Val cys ser Leu Gin cys Asn Asn His Ala cys 

1460 1465 1470 

Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp Pro Trp 

1475 1480 1485 

Lys Asn cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser Asp 

1490 1495 1500 
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Gly His Cys Asp ser Gin cys Asn ser Ala Gly cys Leu Phe Asp 
1505 1510 1515 

Gly Phe Asp Cys Gin Arg Ala Gl u Gly Gin Cys Asn Pro Leu Tyr 
1520 1525 1530 

Asp Gin Tyr Cys Lys Asp His Phe ser Asp Gly His cys Asp Gin 
1535 1540 1545 

Gly cys Asn ser Ala Glu Cys Glu Trp Asp Gly Leu Asp cys Ala 
1550 1555 1560 

Glu His val Pro Glu Arg Leu Ala Ala Gly Thr Leu val Val val 
1565 1570 1575 

val Leu Met Pro pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe 
1580 1585 1590 

Leu Arg Glu Leu ser Arg Val Leu His Thr Asn Val Val Phe Lys 
1595 1600 1605 

Arg Asp Ala His Gly Gin Gin Met He Phe Pro Tyr Tyr Gly Arg 
a 1610 1615 1620 

Glu Glu Glu Leu Arg Lys His Pro lie Lys Arg Ala Ala Glu Gly 
1625 1630 1635 

Trp Ala Ala Pro Asp Ala Leu Leu Gly Gin val Lys Ala ser Leu 
1640 1645 1650 

Leu Pro Gly Gly Ser Glu Gly Gly Arg Arg Arg Arg Glu Leu Asp 
1655 1660 1665 

pro Met Asp val Arg Gly ser He val Tyr Leu Glu lie Asp Asn 
1670 1675 1680 

Arg Gin cys Val Gin Ala ser ser Gin cys Phe Gin Ser Ala Thr 
1685 1690 1695 

Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu Gly Ser Leu 
1700 1705 1710 

Asn He Pro Tyr Lys He Glu Ala Val Gin Ser Glu Thr val Glu 
1715 1720 1725 

pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr val Ala Ala Ala 
1730 1735 1740 

Ala Phe val Leu Leu Phe Phe val Gly cys Gly val Leu Leu Ser 
1745 1750 1755 
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Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly 
1760 1765 1770 

Phe Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu 
1775 1780 1785 

Gly Glu Asp ser Val Gly Leu Lys Pro Leu Lys Asn Ala ser Asp 
1790 1795 1800 

Gly Ala Leu Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp 
1805 1810 1815 

Leu Glu Thr Lys Lys Phe Arg Phe Glu Glu -Pro val val Leu Pro 
1820 1825 1830 

Asp Leu Asp Asp Gin Thr Asp His Arg Gin Trp Thr Gin Gin His 
1835 1840 1845 

Leu Asp Ala Ala Asp Leu Arg Met Ser Ala Met Ala Pro Thr Pro 
1850 1855 1860 

pro Gin Gly Glu val Asp Ala Asp cys Met Asp val Asn val Arg 
1865 1870 1875 

Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 
1880 1885 1890 

Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro 
1895 1900 1905 

Ala val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn 
1910 1915 1920 

Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg 
1925 1930 1935 

Tyr ser Arg ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala 
1940 1945 1950 

Asp Ala Asn lie Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala 
1955 1960 1965 

Ala val ser Ala Asp Ala Gin Gly Val Phe Gin lie Leu He Arg 
1970 1975 1980 

Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 
1985 1990 1995 

Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu 
2000 2005 2010 
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Asp Leu lie Asn Ser His Ala Asp val Asn Ala val Asp Asp Leu 
2015 2020 2025 

Gly Lys ser Ala Leu His Trp Ala Ala Ala Val Asn Asn val Asp 
2030 2035 2040 

Ala Ala val val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin 
2045 2050 2055 

Asn Asn Arg Glu Glu Thr pro Leu Phe Leu Ala Ala Arg Glu Gly 
2060 2065 2070 

ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe Ala Asn Arg 
2075 2080 2085 

Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie Ala Gin 
2090 2095 2100 

Glu Arg Met His His Asp lie val Arg Leu Leu Asp Glu Tyr Asn 
2105 2110 2115 

Leu Val Arg Ser Pro .Gin Leu His Gly Ala Pro Leu Gly Gly Thr 
2120 2125 2130 

Pro Thr Leu Ser Pro Pro Leu cys ser Pro Asn Gly Tyr Leu Gly 
2135 2140 2145 

ser Leu Lys Pro Gly val Gin Gly Lys Lys val Arg Lys Pro ser 
2150 2155 2160 

ser Lys Gly Leu Ala Cys Gly ser Lys Glu Ala Lys Asp Leu Lys 
2165 2170 2175 

Ala Arg Arg Lys Lys ser Gin Asp Gly Lys Gly cys Leu Leu Asp 
2180 2185 2190 

Ser ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His 
2195 2200 2205 

Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser Pro 
2210 2215 2220 

Phe Gin Gin ser Pro Ser val Pro Leu Asn His Leu Pro Gly Met 
2225 2230 2235 

Pro Asp Thr His Leu Gly He Gly His Leu Asn val Ala Ala Lys 
2240 2245 2250 

Pro Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu 
2255 2260 2265 
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Thr Gly Pro Pro Arg Leu ser His Leu pro val Ala Ser Gly Thr 
2270 2275 2280 

Ser Thr Val Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr 
2285 2290 2295 

val Gly Gly ser Thr ser Leu Asn Gly Gin cys Glu Trp Leu Ser 
2300 2305 2310 

Arg Leu Gin Ser Gly Met Val Pro Asn Gin Tyr Asn Pro Leu Arg 
2315 2320 2325 

Gly ser Val Ala Pro Gly -Pro Leu Ser Thr Gin Ala Pro ser Leu 
2330 2335 2340 

Gin His Gly Met Val Gly Pro Leu His Ser Ser Leu Ala Ala Ser 
2345 2350 2355 

Ala Leu ser Gin Met Met Ser Tyr Gin Gly Leu Pro Ser Thr Arg 
2360 2365 2370 

Leu Ala Thr Gin Pro His Leu val Gin Thr Gin Gin Val Gin Pro 
2375 2380 2385 

Gin Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro Ala Asn lie 
2390 2395 2400 

Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro Gin Pro 
2405 2410 2415 

His Leu Gly val Ser ser Ala Ala Ser Gly His Leu Gly Arg ser 
2420 2425 2430 

Phe Leu ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly 
2435 2440 2445 

pro ser Ser Leu Ala val His Thr He Leu Pro Gin Glu Ser Pro 
2450 2455 2460 

Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
2465 2470 2475 

Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr ser ser 
2480 2485 2490 

Pro val Asp Asn Thr Pro Ser His Gin Leu Gin val Pro Glu His 
2495 2500 2505 

Pro Phe Leu Thr Pro ser Pro Glu Ser Pro Asp Gin Trp Ser Ser 
2510 2515 2520 
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ser ser Pro His ser ash val ser Asp Trp ser Glu Gly val ser 
2525 2530 2535 

Ser Pro Pro Thr Ser Met Gin Ser Gin lie Ala Arg lie Pro Glu 
2540 2545 2550 



<210> 16 

<211> 11433 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Notch-2 












<400> 16 
aggctgcttc 


gttgcacacc 


cgagaaagtt 


tcagccaaac 


ttcgggcggc 


ggctgaggcg 


60 


gcggccgagg 


agcggcggac 


tcggggcgcg 


gggagtcgag 


gcatttgcgc 


ctgggcttcg 


120 


gagcgtagcg 


ccagggcctg 


agcctttgaa 


gcaggaggag 


gggaggagag 


agtggggctc 


180 


ctctatcggg 


accccctccc 


catgtggatc 


tgcccaggcg 


gcggcggcgg 


cggcggagga 


240 


ggaggcgacc 


gagaagatgc 


ccgccctgcg 


ccccgctctg 


ctgtgggcgc 


tgctggcgct 


300 


ctggctgtgc 


tgcgcggccc 


ccgcgcatgc 


attgcagtgt 


cgagatggct 


atgaaccctg 


360 


tgtaaatgaa 


ggaatgtgtg 


ttacctacca 


caatggcaca 


ggatactgca 


aatgtccaga 


420 


aggcttcttg 


ggggaatatt 


gtcaacatcg 


agacccctgt 


gagaagaacc 


gctgccagaa 


480 


tggtgggact 


tgtgtggccc 


aggccatgct 


ggggaaagcc 


acgtgccgat 


gtgcctcagg 


540 


gtttacagga 


gaggactgcc 


agtactcaac 


atctcatcca 


tgctttgtgt 


ctcgaccctg 


600 


cctgaatggc 


ggcacatgcc 


atatgctcag 


ccgggatacc 


tatgagtgca 


cctgtcaagt 


660 


cgggtttaca 


ggtaaggagt 


gccaatggac 


ggatgcctgc 


ctgtctcatc 


cctgtgcaaa 


720 


tggaagtacc 


tgtaccactg 


tggccaacca 


gttctcctgc 


aaatgcctca 


caggcttcac 


780 


agggcagaaa 


tgtgagactg 


atgtcaatga 


gtgtgacatt 


ccaggacact 


gccagcatgg 


840 


tggcacctgc 


ctcaacctgc 


ctggttccta 


ccagtgccag 


tgccctcagg 


gcttcacagg 


900 


ccagtactgt 


gacagcctgt 


atgtgccctg 


tgcaccctca 


ccttgtgtca 


atggaggcac 


960 


ctgtcggcag 


actggtgact 


tcacttttga 


gtgcaactgc 


cttccaggtt 


ttgaagggag 


1020 


cacctgtgag 


aggaatattg 


atgactgccc 


taaccacagg 


tgtcagaatg 


gaggggtttg 


1080 


tgtggatggg 


gtcaacactt 


acaactgccg 


ctgtccccca 


caatggacag 


gacagttctg 


1140 


cacagaggat 


gtggatgaat 


gcctgctgca 


gcccaatgcc 


tgtcaaaatg 


ggggcacctg 


1200 


tgccaaccgc 


aatggaggct 


atggctgtgt 


atgtgtcaac 


ggctggagtg 


gagatgactg 


1260 


cagtgagaac 


attgatgatt 


gtgccttcgc 


ctcctgtact 


ccaggctcca 


cctgcatcga 


1320 
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ccgtgtggcc tccttctctt gcatgtgccc agaggggaag gcaggtctcc tgtgtcatct 1380 

ggatgatgca tgcatcagca atccttgcca caagggggca ctgtgtgaca ccaaccccct 1440 

aaatgggcaa tatatttgca cctgcccaca aggctacaaa ggggctgact gcacagaaga 1500 

tgtggatgaa tgtgccatgg ccaatagcaa tccttgtgag catgcaggaa aatgtgtgaa 1560 

cacggatggc gccttccact gtgagtgtct gaagggttat gcaggacctc gttgtgagat 1620 

ggacatcaat gagtgccatt cagacccctg ccagaatgat gctacctgtc tggataagat 1680 

tggaggcttc acatgtctgt gcatgccagg tttcaaaggt gtgcattgtg aattagaaat 1740 

aaatgaatgt cagagcaacc cttgtgtgaa caatgggcag tgtgtggata aagtcaatcg 1800 

tttccagtgc ctgtgtcctc ctggtttcac tgggccagtt tgccagattg atattgatga 1860 

ctgttccagt actccgtgtc tgaatggggc aaagtgtatc gatcacccga atggctatga 1920 

atgccagtgt gccacaggtt tcactggtgt gttgtgtgag gagaacattg acaactgtga 1980 

ccccgatcct tgccaccatg gtcagtgtca ggatggtatt gattcctaca cctgcatctg 2040 

caatcccggg tacatgggcg ccatctgcag tgaccagatt gatgaatgtt acagcagccc 2100 

ttgcctgaac gatggtcgct gcattgacct ggtcaatggc taccagtgca actgccagcc 2160 

aggcacgtca ggggttaatt gtgaaattaa ttttgatgac tgtgcaagta acccttgtat 2220 

ccatggaatc tgtatggatg gcattaatcg ctacagttgt gtctgctcac caggattcac 2280 

agggcagaga tgtaacattg acattgatga gtgtgcctcc aatccctgtc gcaagggtgc 2340 

aacatgtatc aacggtgtga atggtttccg ctgtatatgc cccgagggac cccatcaccc 2400 

cagctgctac tcacaggtga acgaatgcct gagcaatccc tgcatccatg gaaactgtac 2460 

tggaggtctc agtggatata agtgtctctg tgatgcaggc tgggttggca tcaactgtga 2520 

agtggacaaa aatgaatgcc tttcgaatcc atgccagaat ggaggaactt gtgacaatct 2580 

ggtgaatgga tacaggtgta cttgcaagaa gggctttaaa ggctataact gccaggtgaa 2640 

tattgatgaa tgtgcctcaa atccatgcct gaaccaagga acctgctttg atgacataag 2700 

tggctacact tgccactgtg tgctgccata cacaggcaag aattgtcaga cagtattggc 2760 

tccctgttcc ccaaaccctt gtgagaatgc tgctgtttgc aaagagtcac caaattttga 2820 

gagttatact tgcttgtgtg ctcctggctg gcaaggtcag cggtgtacca ttgacattga 2880 

cgagtgtatc tccaagccct gcatgaacca tggtctctgc cataacaccc agggcagcta 2940 

catgtgtgaa tgtccaccag gcttcagtgg tatggactgt gaggaggaca ttgatgactg 3000 

ccttgccaat ccttgccaga atggaggttc ctgtatggat ggagtgaata ctttctcctg 3060 

cctctgcctt ccgggtttca ctggggataa gtgccagaca gacatgaatg agtgtctgag 3120 

tgaaccctgt aagaatggag ggacctgctc tgactacgtc aacagttaca cttgcaagtg 3180 

ccaggcagga tttgatggag tccattgtga gaacaacatc aatgagtgca ctgagagctc 3240 

ctgtttcaat ggtggcacat gtgttgatgg gattaactcc ttctcttgct tgtgccctgt 3300 

gggtttcact ggatccttct gcctccatga gatcaatgaa tgcagctctc atccatgcct 3360 
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y ad Ly ayyya. 


a *-y L y L y L L y 


39467A.txt 
3.tiggcctggg "tacctaccgc 


.txt 

tgcagctgcc 


ccctgggcta 


3420 


C clC "t 9 9 9 3- 3. 3. 


aactgtcaga 


ccctggtgaa tetctgeagt 


cggtctccat 


gtaaaaacaa 


3480 


aggtacttgc 


gtiticagaaaa 


aagcagagtc ccagtgccta 


tgtccatctg 


gatgggctgg 


3540 


"t 9 c c "t cit "t 9 "t 


gacgtgccca 


atgtctcttg tgacatagca 


gcctccagga 




3600 




ttigtgccagc 


actcaggtgt 


ctgcatcaat 


gctggcaaca 




3660 


"Lcagtgcccc 


c ^-9 9 9 c "t a t a 


ctgggagcta ctgtgaggag 


caactcgatg 


agtgtgcgtc 


3720 


caacccctgc 


L.ay v-a^yy yy 


caacatgeag tgacttcatt 


ggtggataca 


gatgegagtg 


3780 


"tgtcccaggc 


La n.ayyy Ly 


tcaactgtga gtatgaagtg 


gatgagtgee 


agaatcagee 


3840 




gg3.ggcacct! 


gtattgacct tgtgaaccat 


ttcaagtgct 


cttgcccacc 


3900 




ggcctactct 


gtgaagagaa 


cattgatgac 


tgtgcccggg 


gtccccattg 


3960 




ggtcagtgca 


tggataggat 


tggaggctac 


agttgtcgct 


gettgectgg 


4020 


ctttgctggg 


gagcgttgtg 


agggagacat 


caacgagtgc 


ctctccaacc 


cctgcagctc 


4080 


"tga.gg9cs.9c 


ctggactgta 


tacagctcac 


caatgactac 


ctgtgtgttt 


gccgtagtgc 


4140 


C "t "t "C 3. C "t 9 9 C 


egg c act 9 1 9 


aaaccttcgt 


cgatgtgtgt 


ccccagatgc 


cctgcctgaa 


4200 


trinannriart" 

L yy ci yyy et *- L 


1 9 1 g c "t 9 1 g 9 


ccagtaacat 


gcctgatggt 


ttcatttgee 


gttgtccccc 


4260 


gggattttcc 


g g g 9 ^ ^ g 9 1 


gecagagcag 


ctgtggacaa 


gtgaaatgta 


ggaaggggga 


4320 




cacaccgcct 


ctggaccccg 


ctgcttctgc 


cccagtcccc 


gggactgega 


4380 


gtca'ggctgt 


gccagtagcc 


cctgccagca 


egggggcage 


tgccaccctc 


agcgccagcc 


4440 


+-/-rt"l-a-l--l-a r 
LLLLLdLLdL 


"tcctgccagt 


gtgccccacc 


attctegggt 


ageegctgtig 


aactctacac 


4500 


ggcacccccc 


agcacccctc 


ctgccacctg 


tetgagecag 


tattgtgccg 


acaaagctcg 


4560 


gg3.~tggcgtx 




cctgcaacag 


ccatgcctgc 


cagtgggatg 


yyyy L y av - L y 


4620 


"t "t C "t C "t C 3. c c 


a^ggagaacc 


cctgggccaa 


ctgctcctcc 


ccacttccct 


gctgggatta 


4680 


"tatCaacaaC 


r a n 1" n 1" n a 1" n 

*- a y L y L y a *-y 


agctgtgcaa 


caeggtcgag 


tgcctgtttg 


acaactttga 


4740 


atgccagggg 


aacagcaaga 


catgeaagta 


tgacaaatac 


tgtgcagacc 


acttcaaaga 


4800 


caaccactgt 


gaccaggggti 


gcaacagtga 


ggagtgtggt 


taaaataaac 
•-yyy 0 u yyy 


tggactgtgc 


4860 


tgctgaccaa 




tggcagaagg 


taccctggtt 


attgtggtat 


tgatgccacc 


4920 




ctxcaggatg 


ctcgcagctt 


ettgegggea 


ctgggtaccc 


tgctccacac 


4980 


caacctgcgc 


attaageggg 


actcccaggg 


ggaactcatg 


gtgtacccct 


attatggtga 


5040 


gaagtxagct 


gctatgaaga 


aacagaggat 


gaeaegcaga 


tcccttcctg 


gtgaacaaga 


5100 


acaggaggtig 


gctggctctia 


aagtctttct 


ggaaattgac 


aaccgccagt 


gtgttcaaga 


5160 


ctcagaccac 


"tgcttcaaga 


acaeggatge 


agcagcagct 


ctcctggcct 


ctcacgccat 


5220 


acaggggacc 


ctgtcatacc 


ctcttgtgtc 


tgtcgtcagt 


gaatccctga 


ctccagaacg 


5280 


cactcagctc 


ctctatctcc 


ttgctgttgc 


tgttgtcatc 


attctgttta 


ttattctget 


5340 


gggggtaatc 


atggcaaaac 


gaaagcgtaa 


gcatggctct 


ctctggctgc 


ctgaaggttt 


5400 
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cactcttcgc cgagatgcaa gcaatcacaa gcgtcgtgag ccagtgggac aggatgctgt 5460 

ggggctgaaa aatctctcag tgcaagtctc agaagctaac ctaattggta ctggaacaag 5 520 

tgaacactgg gtcgatgatg aagggcccca gccaaagaaa gtaaaggctg aagatgaggc 5580 

cttactctca gaagaagatg accccattga tcgacggcca tggacacagc agcaccttga 5640 

agctgcagac atccgtagga caccatcgct ggctctcacc cctcctcagg cagagcagga 5700 

ggtggatgtg ttagatgtga atgtccgtgg cccagatggc tgcaccccat tgatgttggc 5760 

ttctctccga ggaggcagct cagatttgag tgatgaagat gaagatgcag aggactcttc 5820 

tgctaacatc atcacagact tggtctacca gggtgccagc ctccaggccc agacagaccg 5880 

gactggtgag atggccctgc accttgcagc ccgctactca cgggctgatg ctgccaagcg 5940 

tctcctggat gcaggtgcag atgccaatgc ccaggacaac atgggccgct gtccactcca 6000 

tgctgcagtg gcagctgatg cccaaggtgt cttccagatt ctgattcgca accgagtaac 6060 

tgatctagat gccaggatga atgatggtac tacacccctg atcctggctg cccgcctggc 6120 

tgtggaggga atggtggcag aactgatcaa ctgccaagcg gatgtgaatg cagtggatga 6180 

ccatggaaaa tctgctcttc actgggcagc tgctgtcaat aatgtggagg caactctttt 6240 

gttgttgaaa aatggggcca accgagacat gcaggacaac aaggaagaga cacctctgtt 6300 

tcttgctgcc cgggagggga gctatgaagc agccaagatc ctgttagacc attttgccaa 6360 

tcgagacatc acagaccata tggatcgtct tccccgggat gtggctcggg atcgcatgca 6420 

ccatgacatt gtgcgccttc tggatgaata caatgtgacc ccaagccctc caggcaccgt 6480 

gttgacttct gctctctcac ctgtcatctg tgggcccaac agatctttcc tcagcctgaa 6540 

gcacacccca atgggcaaga agtctagacg gcccagtgcc aagagtacca tgcctactag 6600 

cctccctaac cttgccaagg aggcaaagga tgccaagggt agtaggagga agaagtctct 6660 

gagtgagaag gtccaactgt ctgagagttc agtaacttta tcccctgttg attccctaga 6720 

atctcctcac acgtatgttt ccgacaccac atcctctcca atgattacat cccctgggat 6780 

cttacaggcc tcacccaacc ctatgttggc cactgccgcc cctcctgccc cagtccatgc 6840 

ccagcatgca ctatcttttt ctaaccttca tgaaatgcag cctttggcac atggggccag 6900 

cactgtgctt ccctcagtga gccagttgct atcccaccac cacattgtgt ctccaggcag 6960 

tggcagtgct ggaagcttga gtaggctcca tccagtccca gtcccagcag attggatgaa 7020 

ccgcatggag gtgaatgaga cccagtacaa tgagatgttt ggtatggtcc tggctccagc 7080 

tgagggcacc catcctggca tagctcccca gagcaggcca cctgaaggga agcacataac 7140 

cacccctcgg gagcccttgc cccccattgt gactttccag ctcatcccta aaggcagtat 7200 

tgcccaacca gcgggggctc cccagcctca gtccacctgc cctccagctg ttgcgggccc 7260 

cctgcccacc atgtaccaga ttccagaaat ggcccgtttg cccagtgtgg ctttccccac 7320 

tgccatgatg ccccagcagg acgggcaggt agctcagacc attctcccag cctatcatcc 7380 

tttcccagcc tctgtgggca agtaccccac acccccttca cagcacagtt atgcttcctc 7440 
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aaatgctgct 
gacaccatcc 
ctggtcagat 
tgggacacac 
acctccagtg 
atccgggaga 
gttcttattc 
ttattgatta 
tgggtccatg 
cattcttgca 
attctacact 
catctcctcc 
ccgtgattgt 
ggcccatgaa 
ctcacaaggt 
ggaaaatgga 
agaagaatac 
ttaatttttc 
gtttgttgtt 
ttactatgac 
aagagacaag 
ccaagcattc 
cctgtttcac 
atatccacag 
ttagcccaag 
ttatcttgct 
actggtatac 
ttggaaagtt 
aggaactata 
atgacattcc 
caccttttag 
cacctgaatc 
cagaagcatt 
gttaataatt 



gagcgaacac ccagtcacag 
ccagagtctc ctgaccagtg 
gtgaccacca gccctacccc 
atgtctgagc caccacacaa 
tagagacata actgactttt 
gaaatgaaga aatctctgga 
agataatgca agagaagcaa 
ttctaatcta ataagacaag 
tttactctct tctatttgga 
gcttggactg cattttaagc 
agcgtcctgt tgggaattat 
tccttggaca ttcttttgtc 
agccctacca gcatgttata 
agcaactttg gtctcctttc 
ttactttggt atggttctca 
catactgtat tgtgttctcc 
ttttcttcaa caaattttgg 
ttg'tctgtgt gcaggtcttc 
tttctgtgta tgggcctggt 
cctccccact tttttaaaac 
ttaactcgtg caagagccag 
cattgactgc ctgtatggaa 
tcactcaccc agcatatgaa 
aagacactgt ctcaaatgtt 
ggacccagtg acagttgtct 
gcttaaaggc ctgctcacca 
ccagtatgtt ctcactgaag 
ggacttgttt tctatgatcc 
tagcagcctt tgctattttc 
ctttggcaac taacgtagaa 
atgataatgg acaactatag 
cactctctgt attcatgctc 
ttagttaatt gtagataaag 
ggtccatggc tacactgcaa 



39467A.txt.txt 
tggtcacctc cagggtgagc 

gtcaagttca tcaccccact 

tgggggtgct ggaggaggtc 

caacatgcag gtttatgcgt 

gtaaatgctg ctgaggaaca 

gccagcttct agaggtagga 

ttcgtcagtt tcactgggta 

tttgtggaaa tgcaagatga 

gaataagatg gatgcttatt 

cctgcaggct tctgccatat 

gccctggaat tctgcctgaa 

ttcatttggt gcttttggtt 

gggcaagacc tttgtgcttt 

ccctcctgtc ttcccggtat 

gcacaaacct ttcaagtatg 

tgcatatatc attcctggag 

gggcaggaga tcccttcaag 

atataaactt taccaggaag 

cagtgtaaag ttttatcctt 

cagaaaaagg tttggaatgt 

ttacccaccc acaggtcccc 

cacatttgtc ccagatctga 

actagtctta actgttgagc 

gtacccttgc catttaggac 

tccgtttgtc agatgatcag 

atctttcttt cacaccgtgt 

acatggactt tatatgttca 

aaaacagccc tataagaagg 

tgctaccatt tcttttcctc 

actcaacaga acattttcct 

acttgctcat tgttcagact 

ttggcaattt ctttgacttt 

aatagttttc ttcctcttct 

cttccgtcca gtgctgtgat 



atccctacct 


7500 


ctgcttctga 


7560 


agcggggacc 


7620 


gagagagtcc 


7680 


aatgaaggtc 


7740 


aagagaagat 


7800 


tctgcaaggc 


7860 


atacaagcct 


7920 


gaagcccaga 


7980 


ccatgagaag 


8040 


ttgacctacg 


8100 


ttgcacctct 


8160 


tgatcattct 


8220 


cccttggagt 


8280 


ttgtttcttt 


8340 


agagaagggg 


8400 


aggctgcacc 


8460 


aagggtgtga 


8520 


gatagtctag 


8580 


tggaatgacc 


8640 


ctacttcctg 


8700 


gcattctagg 


8760 


ctttcctttc 


8820 


tgaactttcc 


8880 


tctctactga 


8940 


ggtccgtgtt 


9000 


agtgcaggaa 


9060 


ttggaaaagg 


9120 


tgaagcggcc 


9180 


ttcctagagt 


9240 


gattgcccct 


9300 


cttttaaggg 


9360 


ccttgggcca 


9420 


gcccatgaca 


9480 
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cctgcaaaat aagttctgcc tgggcatttt gtagatatta acaggtgaat tcccgactct 9540 

tttggtttga atgacagttc tcattccttc tatggctgca agtatgcatc agtgcttccc 9600 

acttacctga tttgtctgtc ggtggcccca tatggaaacc ctgcgtgtct gttggcataa 9660 

tagtttacaa atggtttttt cagtcctatc caaatttatt gaaccaacaa aaataattac 9720 

ttctgccctg agataagcag attaagtttg ttcattctct gctttattct ctccatgtgg 9780 

caacattctg tcagcctctt tcatagtgtg caaacatttt atcattctaa atggtgactc 9840 

tctgcccttg gacccattta ttattcacag atggggagaa cctatctgca tggacctctg 9900 

tggaccacag cgtacctgcc cctttctgcc ctcctgctcc agccccactt ctgaaagtat 9960 

cagctactga tccagccact ggatatttta tatcctccct tttccttaag cacaatgtca 10020 

gaccaaattg cttgtttctt tttcttggac tactttaatt tggatccttt gggtttggag 10080 

aaagggaatg tgaaagctgt cattacagac aacaggtttc agtgatgagg aggacaacac 10140 

tgcctttcaa actttttact gatctcttag attttaagaa ctcttgaatt gtgtggtatc 10200 

taataaaagg gaaggtaaga tggataatca ctttctcatt tgggttctga attggagact 10260 

cagtttttat gagacacatc ttttatgcca tgtatagatc ctcccctgct atttttggtt 10320 

tatttttatt gttataaatg ctttctttct ttgactcctc ttctgcctgc ctttggggat 10380 

aggttttttt gtttgtttat ttgcttcctc tgttttgttt taagcatcat tttcttatgt 10440 

gaggtgggga agggaaaggt atgagggaaa gagagtctga gaattaaaat attttagtat 10500 

aagcaattgg ctgtgatgct caaatccatt gcatcctctt attgaatttg ccaatttgta 10560 

atttttgcat aataaagaac caaaggtgta atgttttgtt gagaggtggt ttagggattt 10620 

tggccctaac caatacattg aatgtatgat gactatttgg gaggacacat ttatgtaccc 10680 

agaggccccc actaataagt ggtactatgg ttacttcctt gtgtacattt ctcttaaaag 10740 

tgatattata tctgtttgta tgagaaaccc agtaaccaat aaaatgaccg catattcctg 10800 

actaaacgta gtaaggaaaa tgcacacttt gtttttactt ttccgtttca ttctaaaggt 10860 

agttaagatg aaatttatat gaaagcattt ttatcacaaa ataaaaaagg tttgccaagc 10920 

tcagtggtgt tgtatttttt attttccaat actgcatcca tggcctggca gtgttacctc 10980 

atgatgtcat aatttgctga gagagcaaat tttcttttct ttctgaatcc cacaaagcct 11040 

agcaccaaac ttcttttttt cttcctttaa ttagatcata aataaatgat cctggggaaa 11100 

aagcatctgt caaataggaa acatcacaaa actgagcact cttctgtgca ctagccatag 11160 

ctggtgacaa acagatggtt gctcagggac aaggtgcctt ccaatggaaa tgcgaagtag 11220 

ttgctatagc aagaattggg aactgggata taagtcataa tattaattat gctgttatgt 11280 

aaatgattgg tttgtaacat tccttaagtg aaatttgtgt agaacttaat atacaggatt 11340 

ataaaataat attttgtgta taaatttgtt ataagttcac attcatacat ttatttataa 11400 

agtcagtgag atatttgaac atgaaaaaaa aaa 11433 

<210> 17 
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<211> 2471 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Notch-2 

<400> 17 

Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp 
15 10 15 

Leu cys cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp Gly Tyr 
20 25 30 

Glu Pro cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr 
35 40 45 

Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr cys Gin His 
50 55 60 

Arg Asp Pro Cys Glu Lys Asn Arg cys Gin Asn Gly Gly Thr Cys Val 
65 70 75 80 

Ala Gin Ala Met Leu Gly Lys Ala Thr cys Arg cys Ala ser Gly Phe 
85 90 95 

Thr Gly Glu Asp cys Gin Tyr Ser Thr Ser His Pro cys Phe val ser 
100 105 HO 

Arq Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr 
115 120 125 

Tyr Glu cys Thr Cys Gin val Gly Phe Thr Gly Lys Glu Cys Gin Trp 
y 130 135 140 

Thr Asp Ala cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr 
145 150 155 160 

Thr val Ala Asn Gin Phe ser Cys Lys Cys Leu Thr Gly Phe Thr Gly 
165 170 175 

Gin Lys cys Glu Thr Asp val Asn Glu cys Asp He Pro Gly His Cys 
180 185 190 

Gin His Gly Gly Thr Cys Leu Asn Leu Pro Gly ser Tyr Gin Cys Gin 
195 200 205 

cys Pro Gin Gly Phe Thr Gly Gin Tyr cys Asp ser Leu Tyr val Pro 
210 215 220 

Cys Ala Pro ser Pro Cys val Asn Gly Gly Thr cys Arg Gin Thr Gly 
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.txt 
235 



txt 



,225 



230 



240 



Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr 
245 250 255 



Cys Glu Arg Asn lie Asp Asp cys Pro Asn His Arg cys Gin Asn Gly 
260 265 270 



Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro 
275 280 285 



Gin Trp Thr Gly Gin Phe Cys Thr Glu Asp val Asp Glu cys Leu Leu 
290 ' 295 300 



Gin Pro Asn Ala cys Gin Asn Gly Gly Thr cys Ala Asn Arg Asn Gly 
305 310 315 320 



Gly Tyr Gly Cys val cys Val Asn Gly Trp Ser Gly Asp Asp cys ser 



Glu Asn He Asp Asp cys Ala Phe Ala ser cys Thr Pro Gly Ser Thr 
340 345 350 



Cys lie Asp Arg Val Ala ser Phe Ser Cys Met Cys Pro Glu Gly Lys 
355 360 365 



Ala Gly Leu Leu Cys His Leu Asp Asp Ala cys lie ser Asn Pro cys 
370 375 380 



His Lys Gly Ala Leu cys Asp Thr Asn Pro Leu Asn Gly Gin Tyr He 
385 390 395 400 



Cys Thr Cys Pro Gin Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp val 
405 410 415 



Asp Glu cys Ala Met Ala Asn ser Asn Pro Cys Glu His Ala Gly Lys 
H y 420 425 430 



cys val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys Gly Tyr 
435 440 445 



Ala Gly Pro Arq cys Glu Met Asp lie Asn Glu cys His ser Asp Pro 
450 455 460 



cys Gin Asn Asp Ala Thr cys Leu Asp Lys He Gly Gly Phe Thr Cys 
465 470 475 480 



Leu cys Met Pro Gly Phe Lys Gly val His Cys Glu Leu Glu He Asn 
485 490 495 



Glu cys Gin Ser Asn Pro Cys val Asn Asn Gly Gin Cys Val Asp Lys 
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Val Asn Arg Phe Gin Cys Leu cys Pro Pro Gly Phe Thr Gly Pro Val 
515 520 525 

cys Gin lie Asp He Asp Asp cys Ser ser Thr Pro Cys Leu Asn Gly 
530 535 540 

Ala Lys Cys lie Asp His Pro Asn Gly Tyr Glu Cys Gin Cys Ala Thr 
545 550 555 560 

Gly Phe Thr Gly Val Leu Cys Glu Glu Asn He Asp Asn cys Asp Pro 
565 570 575 

Asp Pro Cys His His Gly Gin cys Gin Asp Gly lie Asp Ser Tyr Thr 
580 585 590 

cys lie cys Asn Pro Gly Tyr Met Gly Ala He Cys ser Asp Gin He 
595 600 605 

Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg cys lie Asp 
H 610 615 620 

Leu val Asn Gly Tyr Gin cys Asn cys Gin Pro Gly Thr Ser Gly val 
625 630 635 640 

Asn cys Glu He Asn Phe Asp Asp cys Ala Ser Asn Pro cys lie His 
645 650 655 

Gly He cys Met Asp Gly He Asn Arg Tyr ser Cys val cys Ser Pro 
660 665 670 

Gly Phe Thr Gly Gin Arg Cys Asn He Asp He Asp Glu cys Ala ser 
675 680 685 

Asn Pro cys Arg Lys Gly Ala Thr cys He Asn Gly val Asn Gly Phe 
690 695 700 

Arg cys He cys pro Glu Gly Pro His His Pro ser Cys Tyr Ser Gin 
705 710 715 720 

val Asn Glu cys Leu Ser Asn Pro cys He His Gly Asn cys Thr Gly 
725 730 735 

Gly Leu ser Gly Tyr Lys cys Leu cys Asp Ala Gly Trp val Gly He 
740 745 750 

Asn cys Glu val Asp Lys Asn Glu Cys Leu ser Asn Pro Cys Gin Asn 
755 760 765 

Gly Gly Thr cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr cys Lys 
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780 



770 



775 



Lys Gly Phe Lys Gly Tyr Asn Cys Gin Val Asn lie Asp Glu Cys Ala 
785 790 795 800 



Ser Asn Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp lie ser Gly 
805 810 815 



Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys Gin Thr 
820 825 830 



Val Leu Ala pro cys Ser Pro Asn pro Cys Glu Asn Ala Ala val cys 
835 840 845 



Lys Glu Ser Pro Asn Phe Glu ser Tyr Thr Cys Leu Cys Ala Pro Gly 
850 855 860 



Trp Gin Gly Gin Arg cys Thr lie Asp lie Asp Glu Cys lie Ser Lys 
865 870 875 880 



Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gin Gly ser Tyr Met 
885 890 895 



cys Glu Cys Pro Pro Gly Phe ser Gly Met Asp cys Glu Glu Asp lie 
900 905 910 



Asp Asp cys Leu Ala Asn Pro cys Gin Asn Gly Gly Ser cys Met Asp 
915 920 925 



Gly val Asn Thr Phe Ser cys Leu cys Leu Pro Gly Phe Thr Gly Asp 
930 935 940 



Lys Cys Gin Thr Asp Met Asn Glu cys Leu Ser Glu Pro Cys Lys Asn 
945 950 955 960 



Gly Gly Thr Cys Ser Asp Tyr Val Asn ser Tyr Thr cys Lys Cys Gin 

J ncc Q7(l Q75 



Ala Gly Phe Asp Gly val His cys Glu Asn Asn lie Asn Glu cys Thr 
980 985 990 



Glu ser ser cys Phe Asn Gly Gly Thr Cys Val Asp Gly He Asn ser 
995 1000 1005 



Phe ser Cys Leu cys Pro val Gly Phe Thr Gly ser Phe cys Leu 
1010 ' 1015 1020 



His Glu He Asn Glu cys ser ser His Pro Cys Leu Asn Glu Gly 
1025 1030 1035 



Thr cys val Asp Gly Leu Gly Thr Tyr Arg Cys ser Cys Pro Leu 
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Arg ser Pro Cys Lys Asn Lys Gly Thr Cys Val Gin Lys Lys Ala 
1070 1075 1080 

Glu ser Gin cys Leu Cys Pro Ser Gly Trp Ala Gly Ala Tyr cys 
1085 1090 1095 

Asp val Pro Asn val ser cys Asp lie Ala Ala ser Arg Arg Gly 
1100 1105 1110 

Val Leu Val Glu His Leu Cys Gin His Ser Gly Val Cys lie Asn 
1115 1120 1125 

Ala Gly Asn Thr His Tyr Cys Gin cys Pro Leu Gly Tyr Thr Gly 



ser Tyr cys Glu Glu Gin Leu Asp Glu cys Ala Ser Asn pro cys 
1145 1150 1155 

Gin His Gly Ala Thr Cys Ser Asp Phe lie Gly Gly Tyr Arg cys 
1160 1165 1170 

Glu Cys val Pro Gly Tyr Gin Gly val Asn cys Glu Tyr Glu val 
1175 1180 1185 

Asp Glu Cys Gin Asn Gin Pro Cys Gin Asn Gly Gly Thr Cys He 
1190 1195 1200 

Asp Leu val Asn His Phe Lys Cys Ser cys Pro Pro Gly Thr Arg 
1205 1210 1215 

Gly Leu Leu cys Glu Glu Asn lie Asp Asp cys Ala Arg Gly Pro 
1220 1225 1230 

His cys Leu Asn Gly Gly Gin cys Met Asp Arg lie Gly Gly Tyr 
1235 1240 1245 

Ser cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu Gly 
1250 1255 1260 

Asp lie Asn Glu cys Leu Ser Asn Pro cys Ser ser Glu Gly Ser 
1265 1270 1275 

Leu Asp cys He Gin Leu Thr Asn Asp Tyr Leu Cys val cys Arg 
1280 1285 1290 

ser Ala phe Thr Gly Arg His cys Glu Thr, Phe val Asp val Cys 
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1295 1300 1305 

Pro Gin Met Pro Cys Leu Asn Gly Gly Thr cys Ala val Ala ser 
1310 1315 ' 1320 

Asn Met Pro Asp Gly Phe lie Cys Arg Cys Pro Pro Gly Phe Ser 
1325 1330 1335 

Gly Ala Arg Cys Gin Ser Ser Cys Gly Gin Val Lys cys Arg Lys 
1340 1345 1350 

Gly Glu Gin Cys Val His Thr Ala Ser Gly Pro Arg Cys Phe Cys 
1355 1360 1365 

Pro ser Pro Arg Asp cys Glu Ser Gly Cys Ala ser ser pro cys 
1370 1375 1380 

Gin His Gly Gly ser cys His Pro Gin Arg Gin Pro Pro Tyr Tyr 
1385 1390 1395 

ser cys Gin Cys Ala Pro Pro Phe ser Gly Ser Arg Cys Glu Leu 
1400 1405 1410 

Tyr Thr Ala Pro Pro ser Thr Pro Pro Ala Thr Cys Leu ser Gin 
1415 1420 1425 

Tyr Cys Ala Asp Lys Ala Arg Asp Gly val Cys Asp Glu Ala Cys 
1430 1435 1440 

Asn ser His Ala cys Gin Trp Asp Gly Gly Asp Cys ser Leu Thr 



Met Glu Asn Pro Trp Ala Asn cys ser ser Pro Leu Pro Cys Trp 
1460 1465 1470 

Asp Tyr He Asn Asn Gin Cys Asp Glu Leu cys Asn Thr val Glu 
1475 1480 1485 

cys Leu Phe Asp Asn Phe Glu Cys Gin Gly Asn ser Lys Thr cys 
1490 1495 1500 

Lys Tyr Asp Lys Tyr cys Ala Asp His Phe Lys Asp Asn His Cys 



Asp Gin Gly cys Asn Ser Glu Glu cys Gly Trp Asp Gly Leu Asp 
1520 1525 1530 

cys Ala Ala Asp Gin Pro Glu Asn Leu Ala Glu Gly Thr Leu Val 
y 1535 1540 1545 

lie val val Leu Met Pro Pro Glu Gin Leu Leu Gin Asp Ala Arg 
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1550 



1555 



1560 



Ser Phe Leu Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg 
1565 1570 1575 



lie Lys Arg Asp Ser Gin Gly Glu Leu Met val Tyr Pro Tyr Tyr 
1580 1585 1590 



Gly Glu Lys Ser Ala Ala Met Lys Lys Gin Arg Met Thr Arg Arg 
1595 1600 1605 



ser Leu Pro Gly Glu Gin Glu Gin Glu val Ala Gly Ser Lys val 



Phe Leu Glu lie Asp Asn Arg Gin Cys Val Gin Asp Ser Asp His 
1625 1630 1635 



Cys Phe Lys Asn Thr Asp Ala Ala Ala Ala Leu Leu Ala ser His 
1640 1645 1650 



Ala lie Gin Gly Thr Leu Ser Tyr Pro Leu Val Ser val Val Ser 
1655 1660 1665 



Glu Ser Leu Thr Pro Glu Arg Thr Gin Leu Leu Tyr Leu Leu Ala 
1670 1675 1680 



val Ala Val val He He Leu Phe lie He Leu Leu Gly val He 
1685 1690 1695 



Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro Glu 
1700 1705 1710 



Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu 
1715 1720 1725 



pro val Gly Gin Asp Ala val Gly Leu Lys Asn Leu ser val Gin 
1730 1735 1740 



val ser Glu Ala Asn Leu He Gly Thr Gly Thr ser Glu His Trp 
1745 1750 1755 



Val Asp Asp Glu Gly Pro Gin Pro Lys Lys Val Lys Ala Glu Asp 
1760 1765 1770 



Glu Ala Leu Leu Ser Glu Glu Asp Asp Pro lie Asp Arg Arg Pro 
1775 1780 1785 



Trp Thr Gin Gin His Leu Glu Ala Ala Asp lie Arg Arg Thr Pro 
1790 1795 1800 



ser Leu Ala Leu Thr Pro Pro Gin Ala Glu Gin Glu val Asp val 



1610 



1615 



1620 



67/166 



WO 2005/014854 PCT/EP2004/008819 

39467A.txt.txt 
1805 1810 1815 

Leu Asp val Asn val Arg Gly pro Asp Gly cys Thr pro Leu Met 



Leu Ala Ser Leu Arg Gly Gly Ser ser Asp Leu Ser Asp Glu Asp 
1835 1840 1845 

Glu Asp Ala Glu Asp ser ser Ala Asn lie lie Thr Asp Leu val 
1850 1855 1860 

Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly Glu 
1865 1870 1875 

Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
1880 1885 1890 

Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn 
1895 1900 1905 

Met Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin 
1910 1915 1920 

Gly Val Phe Gin He Leu He Arg Asn Arg val Thr Asp Leu Asp 
1925 1930 1935 

Ala Arg Met Asn Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 
1940 1945 1950 

Leu Ala val Glu Gly Met val Ala Glu Leu lie Asn cys Gin Ala 
1955 I960 1965 

Asp val Asn Ala val Asp Asp His Gly Lys Ser Ala Leu His Trp 
1970 1975 1980 

Ala Ala Ala val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys 
1985 1990 1995 

Asn Gly Ala Asn Arg Asp Met Gin Asp Asn Lys Glu Glu Thr Pro 



Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys He 
2015 2020 2025 

Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp 
2030 2035 2040 

Arq Leu Pro Arg Asp val Ala Arg Asp Arg Met His His Asp lie 
2045 2050 2055 

val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly 
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2060 2065 2070 

Thr val Leu Thr Ser Ala Leu Ser Pro Val lie Cys Gly Pro Asn 

2075 2080 2085 

Arg ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys Ser 

2090 2095 2100 

Arq Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn 

2105 2110 2115 

Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 

2120 2125 2130 

ser Leu ser Glu Lys val Gin Leu ser Glu ser Ser Val Thr Leu 

2135 2140 2145 

Ser Pro Val Asp Ser Leu Glu ser Pro His Thr Tyr Val Ser Asp 

2150 2155 2160 

Thr Thr Ser Ser Pro Met lie Thr Ser Pro Gly lie Leu Gin Ala 

2165 2170 2175 

Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro val 

2180 2185 2190 

His Ala Gin His Ala Leu Ser phe Ser Asn Leu His Glu Met Gin 

2195 2200 2205 

Pro Leu Ala His Gly Ala ser Thr val Leu Pro ser val Ser Gin 

2210 2215 2220 

Leu Leu Ser His His His lie Val Ser Pro Gly Ser Gly Ser Ala 

2225 2230 2235 

Gly Ser Leu Ser Arg Leu His Pro val Pro val Pro Ala Asp Trp 

2240 2245 2250 

Met Asn Arg Met Glu Val Asn Glu Thr Gin Tyr Asn Glu Met Phe 

2255 2260 2265 

Gly Met val Leu Ala Pro Ala Glu Gly Thr His Pro Gly lie Ala 

2270 2275 2280 

pro Gin Ser Arg Pro Pro Glu Gly Lys His lie Thr Thr Pro Arg 

2285 2290 2295 

Glu Pro Leu Pro Pro lie val Thr phe Gin Leu lie Pro Lys Gly 

2300 2305 2310 

ser lie Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin Ser Thr Cys 



69/166 



WO 2005/014854 PCT/EP2004/008819 

39467A.txt.txt 
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pro Pro Ala val Ala Gly Pro Leu Pro Thr Met Tyr Gin He Pro 
2330 2335 2340 

Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met 
2345 2350 2355 

Pro Gin Gin Asp Gly Gin Val Ala Gin Thr lie Leu Pro Ala Tyr 
2360 2365 2370 

His Pro Phe pro Ala Ser Val Gly Lys Tyr Pro Thr pro pro ser 
2375 2380 2385 

Gin His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser 
2390 2395 2400 

His ser Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser 
2405 2410 2415 

Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala 
2420 2425 2430 

Ser Asp Trp Ser Asp Val Thr Thr ser Pro Thr Pro Gly Gly Ala 
2435 2440 2445 

Gly Gly Gly Gin Arg Gly Pro Gly Thr His Met ser Glu Pro Pro 
2450 2455 2460 

His Asn Asn Met Gin Val Tyr Ala 
2465 2470 

<210> 18 

<211> 8091 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Notch-3 

<400> 18 

acgcggcgcg gaggctggcc cgggacgcgc ccggagccca gggaaggagg gaggagggga 60 

gggtcgcggc cggccgccat ggggccgggg gcccgtggcc gccgccgccg ccgtcgcccg 120 

atgtcgccgc caccgccacc gccacccgtg cgggcgctgc ccctgctgct gctgctagcg 180 

gggccggggg ctgcagcccc cccttgcctg gacggaagcc cgtgtgcaaa tggaggtcgt 240 

tgcacccagc tgccctcccg ggaggctgcc tgcctgtgcc cgcctggctg ggtgggtgag 300 

cggtgtcagc tggaggaccc ctgtcactca ggcccctgtg ctggccgtgg tgtctgccag 360 

agttcagtgg tggctggcac cgcccgattc tcatgccggt gcccccgtgg cttccgaggc 420 
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cctgactgct 


ccctgccaga 
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tccctgcctc agcagccctt 


.txt 

gtgcccacgg 


tgcccgctgc 


480 


tcagtggggc 


ccgatggacg 


cttcctctgc tcctgcccac 


ctggctacca 


gggccgcagc 


540 


tgccgaagcg 


acgtggatga 


gtgccgggtg ggtgagccct 


gccgccatgg 


tggcacctgc 


600 


ctcaacacac 


ctggctcctt 


ccgctgccag tgtccagctg 


gctacacagg 


gccactatgt 


660 


gagaaccccg 


cggtgccctg 


tgcgccctca ccatgccgta 


acgggggcac 


ctgcaggcag 


720 


agtggcgacc 


tcacttacga 


ctgtgcctgt cttcctgggt 


ttgagggtca 


gaattgtgaa 


780 


gtgaacgtgg 


acgactgtcc 


aggacaccga tgtctcaatg 


gggggacatg 


cgtggatggc 


840 


gtcaacacct 


ataactgcca 


gtgccctcct gagtggacag 


gccagttctg 


cacggaggac 


900 


gtggatgagt 


gtcagctgca 


gcccaacgcc tgccacaatg 


ggggtacctg 


cttcaacacg 


960 


ctgggtggcc 


acagctgcgt 


gtgtgtcaat ggctggacag 


gtgagagctg 


cagtcagaat 


1020 


atcgatgact 


gtgccacagc 


cgtgtgcttc catggggcca 


cctgccatga 


ccgcgtggct 


1080 


tctttctact 


gtgcctgccc 


catgggcaag actggcctcc 


tgtgtcacct 


ggatgacgcc 


1140 


tgtgtcagca 


acccctgcca 


cgaggatgct atctgtgaca 


caaatccggt 


gaacggccgg 


1200 


gccatttgca 


cctgtcctcc 


cggcttcacg ggtggggcat 


gtgaccagga 


tgtggacgag 


1260 


tgctctatcg 


gcgccaaccc 


ctgcgagcac ttgggcaggt 


gcgtgaacac 


gcagggctcc 


1320 


ttcctgtgcc 


agtgcggtcg 


tggctacact ggacctcgct 


gtgagaccga 


tgtcaacgag 


1380 


tgtctgtcgg 


ggccctgccg 


aaaccaggcc acgtgcctcg 


accgcatagg 


ccagttcacc 


1440 


tgtatctgta 


tggcaggctt 


cacaggaacc tattgcgagg 


tggacattga 


cgagtgtcag 


• 1500 


agtagcccct 


gtgtcaacgg 


tggggtctgc aaggaccgag 


tcaatggctt 


cagctgcacc 


1560 


tgcccctcgg 


gcttcagcgg 


ctccacgtgt cagctggacg 


tggacgaatg 


cgccagcacg 


1620 


ccctgcagga 


atggcgccaa 


atgcgtggac cagcccgatg 


gctacgagtg 


ccgctgtgcc 


1680 


gagggctttg 


agggcacgct 


gtgtgatcgc aacgtggacg 


actgctcccc 


tgacccatgc 


1740 


caccatggtc 


gctgcgtgga 


tggcatcgcc agcttctcat 


gtgcctgtgc 


tcctggctac 


1800 


acgggcacac 


gctgcgagag 


ccaggtggac gaatgccgca 


gccagccctg 


ccgccatggc 


1860 


ggcaaatgcc 


tagacctggt 


ggacaagtac ctctgccgct 


gcccttctgg 


gaccacaggt 


1920 


gtgaactgcg 


aagtgaacat 


tgacgactgt gccagcaacc 


cctgcacctt 


tggagtctgc 


1980 


cgtgatggca 


tcaaccgcta 


cgactgtgtc tgccaacctg 


gcttcacagg 


gcccctttgt 


2040 


aacgtggaga 


tcaatgagtg 


tgcttccagc ccatgcggcg 


agggaggttc 


ctgtgtggat 


2100 


ggggaaaatg 


gcttccgctg 


cctctgcccg cctggctcct 


tgcccccact 


ctgcctcccc 


2160 


ccgagccatc 


cctgtgccca 


tgagccctgc agtcacggca 


tctgctatga 


tgcacctggc 


2220 


gggttccgct 


gtgtgtgtga 


gcctggctgg agtggccccc 


gctgcagcca 


gagcctggcc 


2280 


cgagacgcct 


gtgagtccca 


gccgtgcagg gccggtggga 


catgcagcag 


cgatggaatg 


2340 


ggtttccact 


gcacctgccc 


gcctggtgtc cagggacgtc 


agtgtgaact 


cctctccccc 


2400 


tgcaccccga 


acccctgtga 


gcatgggggc cgctgcgagt 


ctgcccctgg 


ccagctgcct 


2460 
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ctggcaaggc 


39467A.txt 
ccacgatgcc 


.txt 

agcaggatgt 


ggacgagtgt 


2520 


gctggccccg 


caccctgtgg 


ccctcatggt 


atctgcacca 


acctggcagg 


gagtttcagc 


2580 


tgcacctgcc 


atggagggta 


cactggccct 


tcctgtgatc 


aggacatcaa 


tgactgtgac 


2640 


cccaacccat 


gcctgaacgg 


tggctcgtgc 


caagacggcg 


tgggctcctt 


ttcctgctcc 


2700 


tgcctccctg 


gtttcgccgg 


cccacgatgc 


gcccgcgatg 


tggatgagtg 


cctgagcaac 


2760 


ccctgcggcc 


cgggcacctg 


taccgaccac 


gtggcctcct 


tcacctgcac 


ctgcccgccg 


2820 


ggctacggag 


gcttccactg 


cgaacaggac 


ctgcccgact 


gcagccccag 


ctcctgcttc 


2880 


aatggcggga 


cctgtgtgga 


cggcgtgaac 


tcgttcagct 


gcctgtgccg 


tcccggctac 


2940 


acaggagccc 


actgccaaca 


tgaggcagac 


ccctgcctct 


cgcggccctg 


c.ctacacggg " 


3000 


ggcgtctgca 


gcgccgccca 


ccctggcttc 


cgctgcacct 


gcctcgagag 


cttcacgggc 


3060 


ccgcagtgcc 


agacgctggt 


ggattggtgc 


agccgccagc 


cttgtcaaaa 


cgggggtcgc 


3120 


tgcgtccaga 


ctggggccta 


ttgcctttgt 


ccccctggat 


ggagcggacg 


cctctgtgac 


3180 


atccgaagct 


tgccctgcag 


ggaggccgca gcccagatcg 


gggtgcggct 


ggagcagctg 


3240 


tgtcaggcgg 


gtgggcagtg 


tgtggatgaa 


gacagctccc 


actactgcgt 


gtgcccagag 


3300 


ggccgtactg 


gtagccactg 


tgagcaggag 


gtggacccct 


gcttggccca 


gccctgccag 


3360 


catgggggga 


cctgccgtgg 


ctatatgggg 


ggctacatgt 


gtgagtgtct 


tcctggctac 


3420 


aatggtgata 


actgtgagga 


cgacgtggac 


gagtgtgcct 


cccagccctg 


ccagcacggg 


3480 


ggttcatgca 


ttgacctcgt 


ggcccgctat 


ctctgctcct 


gtcccccagg 


aacgctgggg 


3540 


gtgctctgcg 


agattaatga 


ggatgactgc 


ggcccaggcc 


caccgctgga 


ctcagggccc 


3600 


cggtgcctac 


acaatggcac 


ctgcgtggac 


ctggtgggtg 


gtttccgctg 


cacctgtccc 


3660 


ccaggataca 


ctggtttgcg 


ctgcgaggca 


gacatcaatg 


agtgtcgctc 


aggtgcctgc 


3720 


cacgcggcac 


acacccggga 


ctgcctgcag 


gacccaggcg 


gaggtttccg 


ttgcctttgt 


3780 


catgctggct 


tctcaggtcc 


tcgctgtcag 


actgtcctgt 


ctccctgcga 


gtcccagcca 


3840 


tgccagcatg 


gaggccagtg 


ccgtcctagc 


ccgggtcctg 


ggggtgggct 


gaccttcacc 


3900 


tgtcactgtg 


cccagccgtt 


ctggggtccg 


cgttgcgagc 


gggtggcgcg 


ctcctgccgg 


3960 


gagctgcagt 


gcccggtggg 


cgtcccatgc 


cagcagacgc 


cccgcgggcc 


gcgctgcgcc 


4020 


tgccccccag 


ggttgtcggg 


accctcctgc 


cgcagcttcc 


cggggtcgcc 


gccgggggcc 


4080 


agcaacgcca 


gctgcgcggc 


cgccccctgt 


ctccacgggg 


gctcctgccg 


ccccgcgccg 


4140 


ctcgcgccct 


tcttccgctg 


cgcttgcgcg 


cagggctgga 


ccgggccgcg 


ctgcgaggcg 


4200 


cccgccgcgg 


cacccgaggt 


ctcggaggag 


ccgcggtgcc 


cgcgcgccgc 


ctgccaggcc 


4260 


aagcgcgggg 


accagcgctg 


cgaccgcgag 


tgcaacagcc 


caggctgcgg 


ctgggacggc 


4320 


ggcgactgct 


cgctgagcgt 


gggcgacccc tggcggcaat 


gcgaggcgct 


gcagtgctgg 


4380 


cgcctcttca 


acaacagccg 


ctgcgacccc 


gcctgcagct 


cgcccgcctg 


cctctacgac 


4440 


aacttcgact 


gccacgccgg 


tggccgcgag 


cgcacttgca 


acccggtgta 


cgagaagtac 


4500 
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tgcgccgacc actttgccga cggccgctgc gaccagggct gcaacacgga ggagtgcggc 4560 

tgggatgggc tggattgtgc cagcgaggtg ccggccctgc tggcccgcgg cgtgctggtg 4620 

ctcacagtgc tgctgccgcc ggaggagcta ctgcgttcca gcgccgactt tctgcagcgg 4680 

ctcagcgcca tcctgcgcac ctcgctgcgc ttccgcctgg acgcgcacgg ccaggccatg 4740 

gtcttccctt accaccggcc tagtcctggc tccgaacccc gggcccgtcg ggagctggcc 4800 

cccgaggtga tcggctcggt agtaatgctg gagattgaca accggctctg cctgcagtcg 4860 

cctgagaatg atcactgctt ccccgatgcc cagagcgccg ctgactacct gggagcgttg 4920 

tcagcggtgg agcgcctgga cttcccgtac ccactgcggg acgtgcgggg ggagccgctg 4980 

gagcctccag aacccagcgt cccgctgctg ccactgctag tggcgggcgc tgtcttgctg 5040 

ctggtcattc tcgtcctggg tgtcatggtg gcccggcgca agcgcgagca cagcaccctc 5100 

tggttccctg agggcttctc actgcacaag gacgtggcct ctggtcacaa gggccggcgg 5160 

gaacccgtgg gccaggacgc gctgggcatg aagaacatgg ccaagggtga gagcctgatg 5220 

ggggaggtgg ccacagactg gatggacaca gagtgcccag aggccaagcg gctaaaggta 5280 

gaggagccag gcatgggggc tgaggaggct gtggattgcc gtcagtggac tcaacaccat 5340 

ctggttgctg ctgacatccg cgtggcacca gccatggcac tgacaccacc acagggcgac 5400 

gcagatgctg atggcatgga tgtcaatgtg cgtggcccag atggcttcac cccgctaatg 5460 

ctggcttcct tctgtggggg ggctctggag ccaatgccaa ctgaagagga tgaggcagat 5520 

gacacatcag ctagcatcat ctccgacctg atctgccagg gggctcagct tggggcacgg 5580 

actgaccgta ctggcgagac tgctttgcac ctggctgccc gttatgcccg tgctgatgca 5640 

gccaagcggc tgctggatgc tggggcagac accaatgccc aggaccactc aggccgcact 5700 

cccctgcaca cagctgtcac agccgatgcc cagggtgtct tccagattct catccgaaac 5760 

cgctctacag acttggatgc ccgcatggca gatggctcaa cggcactgat cctggcggcc 5820 

cgcctggcag tagagggcat ggtggaagag ctcatcgcca gccatgctga tgtcaatgct 5880 

gtggatgagc ttgggaaatc agccttacac tgggctgcgg ctgtgaacaa cgtggaagcc 5940 

actttggccc tgctcaaaaa tggagccaat aaggacatgc aggatagcaa ggaggagacc 6000 

cccctattcc tggccgcccg cgagggcagc tatgaggctg ccaagctgct gttggaccac 6060 

tttgccaacc gtgagatcac cgaccacctg gacaggctgc cgcgggacgt agcccaggag 6120 

agactgcacc aggacatcgt gcgcttgctg gatcaaccca gtgggccccg cagccccccc 6180 

ggtccccacg gcctggggcc tctgctctgt cctccagggg ccttcctccc tggcctcaaa 6240 

gcggcacagt cggggtccaa gaagagcagg aggccccccg ggaaggcggg gctggggccg 6300 

caggggcccc gggggcgggg caagaagctg acgctggcct gcccgggccc cctggctgac 6360 

agctcggtca cgctgtcgcc cgtggactcg ctggactccc cgcggccttt cggtgggccc 6420 

cctgcttccc ctggtggctt cccccttgag gggccctatg cagctgccac tgccactgca 6480 

gtgtctctgg cacagcttgg tggcccaggc cgggcaggtc tagggcgcca gccccctgga 6540 
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ggatgtgtac tcagcctggg cctgctgaac cctgtggctg tgcccctcga ttgggcccgg 6600 

ctgcccccac ctgcccctcc aggcccctcg ttcctgctgc cactggcgcc gggaccccag 6660 

ctgctcaacc cagggacccc cgtctccccg caggagcggc ccccgcctta cctggcagtc 6720 

ccaggacatg gcgaggagta cccggtggct ggggcacaca gcagcccccc aaaggcccgc 6780 

ttcctgcggg ttcccagtga gcacccttac ctgaccccat cccccgaatc ccctgagcac 6840 

tgggccagcc cctcacctcc ctccctctca gactggtccg aatccacgcc tagcccagcc 6900 

actgccactg gggccatggc caccaccact ggggcactgc ctgcccagcc acttcccttg 6960 

tctgttccca gctcccttgc tcaggcccag acccagctgg ggccccagcc ggaagttacc 7020 

cccaagaggc aagtgttggc ctgagacgct cgtcagttct tagatcttgg gggcctaaag 7080 

agacccccgt cctgcctcct ttctttctct gtctcttcct tccttttagt ctttttcatc 7140 

ctcttctctt tccaccaacc ctcctgcatc cttgccttgc agcgtgaccg agataggtca 7200 

tcagcccagg gcttcagtct tcctttattt ataatgggtg ggggctacca cccaccctct 7260 

cagtcttgtg aagagtctgg gacctccttc ttccccactt ctctcttccc tcattccttt 7320 

ctctctcctt ctggcctctc atttccttac actctgacat gaatgaatta ttattatttt 7380 

tctttttctt ttttttttta cattttgtat agaaacaaat tcatttaaac aaacttatta 7440 

ttattatttt ttacaaaata tatatatgga gatgctccct ccccctgtga accccccagt 7500 

gcccccgtgg ggctgagtct gtgggcccat tcggccaagc tggattctgt gtacctagta 7560 

cacaggcatg actgggatcc cgtgtaccga gtacacgacc caggtatgta ccaagtaggc 7620 

acccttgggc gcacccactg gggccagggg tcgggggagt gttgggagcc tcctccccac 7680 
cccacctccc tcacttcact gcattccaga ttggacatgt tccatagcct tgctggggaa 
gggcccactg ccaactccct ctgccccagc cccacccttg gccatctccc tttgggaact 

agggggctgc tggtgggaaa tgggagccag ggcagatgta tgcattcctt tatgtccctg 7860 
taaatgtggg actacaagaa gaggagctgc ctgagtggta ctttctcttc ctggtaatcc 
tctggcccag ccttatggca gaatagaggt atttttaggc tatttttgta atatggcttc 
tggtcaaaat ccctgtgtag ctgaattccc aagccctgca ttgtacagcc ccccactccc 
ctcaccacct aataaaggaa tagttaacac tcaaaaaaaa aaaaaaaaaa a 

<210> 19 

<211> 2321 

<212> PRT 

. <213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Notch-3 

<400> 19 

Met Gly Pro Gly Ala Arg Gly Arg Arg Arg Arg Arg Arg Pro Met Ser 
! 5 10 15 



7740 
7800 



7920 
7980 
8040 
8091 
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pro pro Pro Pro Pro Pro Pro Val Arg Ala Leu pro Leu Leu Leu Leu 
20 25 30 

Leu Ala Gly Pro Gly Ala Ala Ala Pro Pro Cys Leu Asp Gly ser Pro 
35 40 45 

cys Ala Asn Gly Gly Arg Cys Thr Gin Leu Pro ser Arg Glu Ala Ala 
50 55 60 

Cys Leu cys Pro pro Gly Trp val Gly Glu Arg cys Gin Leu Glu Asp 
65 " 70 75 80 

pro cys His Ser Gly Pro Cys Ala Gly Arg Gly val cys Gin ser ser 
85 90 95 

Val val Ala Gly Thr Ala Arg phe ser cys Arg Cys Pro Arg Gly Phe 
100 105 HO 

Arq Gly Pro Asp Cys Ser Leu Pro Asp Pro Cys Leu Ser Ser Pro cys 
115 120 125 

Ala His Gly Ala Arg Cys ser val Gly Pro Asp Gly Arg Phe Leu cys 
130 135 140 

ser cys Pro Pro Gly Tyr Gin Gly Arg ser cys Arg ser Asp val Asp 
145 150 I" 160 

Glu cys Arg val Gly Glu Pro cys Arg His Gly Gly Thr cys Leu Asn 
165 170 175 

Thr Pro Gly ser Phe Arg Cys Gin cys Pro Ala Gly Tyr Thr Gly Pro 
180 185 190 

Leu cys Glu Asn Pro Ala Val Pro Cys Ala Pro ser Pro cys Arg Asn 
195 200 205 

Gly Gly Thr Cys Arg Gin ser Gly Asp Leu Thr Tyr Asp cys Ala cys 
210 215 220 

Leu pro Gly Phe Glu Gly Gin Asn Cys Glu Val Asn Val Asp Asp cys 
225 230 235 240 

Pro Gly His Arg cys Leu Asn Gly Gly Thr Cys val Asp Gly Val Asn 
245 250 255 

Thr Tyr Asn cys Gin Cys Pro Pro Glu Trp Thr Gly Gin phe Cys Thr 
260 265 270 

Glu asp Val Asp Glu cys Gin Leu Gin Pro Asn Ala cys His Asn Gly 
H 275 280 285 
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Gly Thr cys Phe Asn Thr Leu Gly Gly His ser cys val cys val Asn 
290 295 300 

Gly Trp Thr Gly Glu Ser Cys Ser Gin Asn lie Asp Asp Cys Ala Thr 
305 310 315 320 

Ala Val cys Phe His Gly Ala Thr cys His Asp Arg val Ala ser Phe 
325 330 335 

Tyr Cys Ala Cys Pro Met Gly Lys Thr Gly Leu Leu Cys His Leu Asp 
340 345 350 

Asp Ala" Cys val ser Asn Pro Cys His Glu Asp Ala lie cys Asp Thr 
355 360 365 

Asn Pro Val Asn Gly Arg Ala lie Cys Thr cys Pro Pro Gly Phe Thr 
370 375 380 

Gly Gly Ala Cys Asp Gin Asp val Asp Glu Cys Ser lie Gly Ala Asn 
385 390 395 400 

Pro cys Glu His Leu Gly Arg Cys val Asn Thr Gin Gly ser Phe Leu 
405 410 415 

Cys Gin cys Gly Arg Gly Tyr Thr Gly Pro Arg Cys Glu Thr Asp Val 
420 425 430 

Asn Glu Cys Leu Ser Gly Pro Cys Arg Asn Gin Ala Thr Cys Leu Asp 
435 440 445 

Arg He Gly Gin Phe Thr cys He cys Met Ala Gly Phe Thr Gly Thr 
450 455 460 

Tyr Cys Glu Val Asp lie Asp Glu Cys Gin Ser Ser Pro Cys val Asn 
465 470 475 480 

Gly Gly val Cys Lys Asp Arg val Asn Gly Phe ser cys Thr Cys Pro 
485 490 495 

Ser Gly Phe ser Gly ser Thr cys Gin Leu Asp val Asp Glu Cys Ala 
500 505 510 

ser Thr Pro Cys Arg Asn Gly Ala Lys cys Val Asp Gin Pro Asp Gly 
515 520 525 

Tyr Glu cys Arg Cys Ala Glu Gly Phe Glu Gly Thr Leu cys Asp Arg 
530 535 540 

Asn val Asp Asp cys Ser Pro Asp Pro Cys His His Gly Arg cys Val 
545 550 555 560 
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Asp Gly lie Ala Ser Phe Ser Cys Ala Cys Ala Pro Gly Tyr Thr Gly 
565 570 575 

Thr Arg Cys Glu ser Gin val Asp Glu cys Arg ser Gin Pro cys Arg 
580 585 590 

His Gly Gly Lys Cys Leu Asp Leu Val Asp Lys Tyr Leu Cys Arg Cys 
595 600 605 

Pro ser Gly Thr Thr Gly val Asn Cys Glu val Asn lie Asp Asp Cys 
610 615 620 

Ala ser Asn Pro cys Thr phe Gly val Cys Arg Asp Gly He Ash Arg 
625 630 635 640 

Tvr Asp Cys Val cys Gin Pro Gly Phe Thr Gly Pro Leu Cys Asn Val 
645 650 655 

Glu lie Asn Glu cys Ala Ser Ser pro cys Gly Glu Gly Gly Ser Cys 
660 665 670 

val Asp Gly Glu Asn Gly Phe Arg Cys Leu Cys Pro Pro Gly Ser Leu 
675 680 685 

Pro Pro Leu Cys Leu Pro Pro Ser His Pro Cys Ala His Glu Pro cys 
690 695 700 

Ser His Gly lie Cys Tyr Asp Ala Pro Gly Gly Phe Arg cys val cys 
705 710 715 720 

Glu Pro Gly Trp Ser Gly Pro Arg cys ser Gin Ser Leu Ala Arg Asp 
725 730 735 

Ala cys Glu ser Gin Pro cys Arg Ala Gly Gly Thr Cys Ser ser Asp 
740 745 750 

Gly Met Gly Phe His cys Thr Cys Pro Pro Gly Val Gin Gly Arg Gin 
755 760 765 

cvs Glu Leu Leu Ser Pro Cys Thr Pro Asn Pro cys Glu His Gly Gly 
770 775 780 

Arg Cys Glu Ser Ala Pro Gly Gin Leu Pro Val Cys Ser Cys Pro Gin 
785 790 795 800 

Gly Trp Gin Gly Pro Arg cys Gin Gin Asp val Asp Glu cys Ala Gly 
3 K 805 810 815 

Pro Ala Pro cys Gly Pro His Gly He Cys Thr Asn Leu Ala Gly ser 
820 825 830 
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Phe ser cys Thr Cys His Gly Gly Tyr Thr Gly Pro Ser Cys Asp Gin 
835 840 845 

Asp lie Asn Asp Cys Asp Pro Asn Pro Cys Leu Asn Gly Gly Ser Cys 
850 855 860 

Gin Asp Gly val Gly ser Phe Ser Cys ser cys Leu Pro Gly phe Ala 
865 870 875 880 

Gly Pro Arg cys Ala Arg Asp Val Asp Glu Cys Leu ser Asn Pro Cys 
885 890 895 

Gly pro Gly Thr Cys Thr Asp His val Ala Ser Phe Thr Cys Thr cys 
900 905 910 

Pro Pro Gly Tyr Gly Gly Phe His cys Glu Gin Asp Leu Pro Asp cys 
915 920 925 

ser Pro ser ser Cys Phe Asn Gly Gly Thr cys val Asp Gly val Asn 
930 935 940 

ser Phe ser cys Leu cys Arg Pro Gly Tyr Thr Gly Ala His cys Gin 
945 950 955 960 

His Glu Ala Asp Pro cys Leu ser Arg Pro cys Leu His Gly Gly val 
965 970 975 

cys ser Ala Ala His Pro Gly Phe Arg cys Thr cys Leu Glu ser Phe 
980 985 990 

Thr Gly Pro Gin Cys Gin Thr Leu val Asp Trp cys ser Arg Gin Pro 
y 995 1000 1005 

Cys Gin Asn Gly Gly Arg cys val Gin Thr Gly Ala Tyr cys Leu 
1010 1015 1020 

cys Pro Pro Gly Trp Ser Gly Arg Leu Cys Asp lie Arg Ser Leu 
1025 1030 1035 

Pro Cys Arg Glu Ala Ala Ala Gin He Gly Val Arg Leu Glu Gin 
1040 1045 1050 

Leu cys Gin Ala Gly Gly Gin cys Val Asp Glu Asp ser Ser His 
1055 1060 1065 

Tvr cvs val cys Pro Glu Gly Arg Thr Gly ser His cys Glu Gin 
y 1070 1075 1080 

Glu val Asp Pro cys Leu Ala Gin Pro cys Gin His Gly Gly Thr 
1085 1090 1095 
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Cys Arg Gly Tyr Met Gly Gly Tyr Met Cys Glu Cys Leu Pro Gly 
1100 1105 1110 

Tyr Asn Gly Asp Asn cys Glu Asp Asp Val Asp Glu cys Ala ser 
1115 1120 1125 

Gin Pro Cys Gin His Gly Gly Ser Cys lie Asp Leu Val Ala Arg 
1130 1135 1140 

Tyr Leu Cys Ser Cys Pro Pro Gly Thr Leu Gly val Leu cys Glu 
1145 1150 1155 

lie Ash Glu Asp Asp Cys Gly Pro Gly Pro Pro Leu Asp Ser Gly 
1160 1165 1170 

Pro Arg Cys Leu His Asn Gly Thr Cys Val Asp Leu val Gly Gly 
1175 1180 1185 

Phe Arg cys Thr Cys Pro Pro Gly Tyr Thr Gly Leu Arg Cys Glu 
1190 1195 1200 

Ala Asp lie Asn Glu Cys Arg Ser Gly Ala Cys His Ala Ala His 
1205 1210 1215 

Thr Arg Asp cys Leu Gin Asp Pro Gly Gly Gly Phe Arg cys Leu 
1220 1225 1230 

cys His Ala Gly Phe ser Gly pro Arg cys Gin Thr val Leu ser 
1235 1240 1245 

pro cys Glu ser Gin Pro Cys Gin His Gly Gly Gin cys Arg pro 
1250 1255 1260 

ser Pro Gly Pro Gly Gly Gly Leu Thr phe Thr Cys His Cys Ala 
1265 1270 1275 

Gin Pro Phe Trp Gly Pro Arg Cys Glu Arg val Ala Arg Ser cys 
1280 1285 1290 

Arq Glu Leu Gin Cys Pro Val Gly val Pro cys Gin Gin Thr pro 
1295 1300 1305 

Arq Gly Pro Arg Cys Ala Cys Pro Pro Gly Leu ser Gly Pro Ser 
1310 1315 1320 

Cvs Arq ser Phe Pro Gly ser Pro Pro Gly Ala Ser Asn Ala ser 
y 1325 1330 1335 

cys Ala Ala Ala pro cys Leu His Gly Gly Ser Cys Arg Pro Ala 
1340 1345 1350 



WO 2005/014854 PCT/EP2004/008819 
39467A.txt.txt 

Pro Leu Ala Pro Phe Phe Arg Cys Ala Cys Ala Gin Gly Trp Thr 
1355 1360 1365 

Gly Pro Arg Cys Glu Ala Pro Ala Ala Ala pro Glu val ser Glu 
1370 1375 1380 

Glu Pro Arg Cys Pro Arg Ala Ala Cys Gin Ala Lys Arg Gly Asp 
1385 1390 1395 

Gin Arg Cys Asp Arg Glu Cys Asn Ser pro Gly Cys Gly Trp Asp 
1400 1405 1410 

Gly Gly Asp Cys Ser Leu Ser Val Gly Asp Pro Trp Arg Gin Cys 
1415 1420 1425 

Glu Ala Leu Gin Cys Trp Arg Leu Phe Asn Asn ser Arg cys Asp 
1430 1435 1440 

Pro Ala cys ser ser Pro Ala Cys Leu Tyr Asp Asn Phe Asp Cys 
1445 1450 1455 

His Ala Gly Gly Arg Glu Arg Thr cys Asn Pro Val Tyr Glu Lys 
1460 1465 1470 

Tyr cys Ala Asp His Phe Ala Asp Gly Arg cys Asp Gin Gly cys 
1475 1480 1485 

Asn Thr Glu Glu Cys Gly Trp Asp Gly Leu Asp cys Ala Ser Glu 
1490 1495 1500 

val Pro Ala Leu Leu Ala Arg Gly Val Leu Val Leu Thr Val Leu 
1505 1510 1515 

Leu Pro Pro Glu Glu Leu Leu Arg ser ser Ala Asp Phe Leu Gin 
1520 1525 1530 

Arq Leu Ser Ala lie Leu Arg Thr Ser Leu Arg Phe Arg Leu Asp 
1535 1540 1545 

Ala His Gly Gin Ala Met val Phe Pro Tyr His Arg Pro Ser Pro 
1550 1555 1560 

Gly ser Glu Pro Arg Ala Arg Arg Glu Leu Ala Pro Glu Val lie 
1565 1570 1575 

Glv ser val val Met Leu Glu lie Asp Asn Arg Leu cys Leu Gin 
1580 1585 1590 

ser Pro Glu Asn Asp His cys Phe Pro Asp Ala Gin ser Ala Ala 
1595 1600 1605 
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Asp Tyr Leu Gly Ala Leu Ser Ala Val Glu Arg Leu Asp Phe Pro 
1610 ' 1615 1620 

Tyr Pro Leu Arg Asp Val Arg Gly Glu pro Leu Glu Pro Pro Glu 
1625 1630 1635 

Pro Ser Val Pro Leu Leu Pro Leu Leu Val Ala Gly Ala Val Leu 



Leu Leu val He Leu val Leu Gly val Met val Ala Arg Arg Lys 
1655 1660 1665 

Arg Glu His Ser Thr Leu Trp Phe Pro Glu Gly Phe Ser Leu His 
1670 1675 1680 

Lys Asp val Ala Ser Gly His Lys Gly Arg Arg Glu Pro Val Gly 
1685 1690 1695 

Gin Asp Ala Leu Gly Met Lys Asn Met Ala Lys Gly Glu Ser Leu 
1700 1705 1710 

Met Gly Glu Val Ala Thr Asp Trp Met Asp Thr Glu Cys Pro Glu 
1715 1720 1725 

Ala Lys Arg Leu Lys Val Glu Glu Pro Gly Met Gly Ala Glu Glu 



Ala Val Asp Cys Arg Gin Trp Thr Gin His His Leu Val Ala Ala 
1745 1750 1755 

Asp He Arg Val Ala Pro Ala Met Ala Leu Thr Pro Pro Gin Gly 
1760 1765 1770 

Asp Ala Asp Ala Asp Gly Met Asp val Asn val Arg Gly Pro Asp 
1775 1780 1785 

Gly Phe Thr Pro Leu Met Leu Ala Ser Phe Cys Gly Gly Ala Leu 
1790 1795 1800 

Glu Pro Met Pro Thr Glu Glu Asp Glu Ala Asp Asp Thr ser Ala 
1805 1810 1815 

Ser He He ser Asp Leu He Cys Gin Gly Ala Gin Leu Gly Ala 
1820 1825 1830 

Arq Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg 
1835 1840 1845 

Tyr Ala Arg Ala Asp Ala Ala Lys Arg Leu Leu Asp Ala Gly Ala 
1850 1855 I860 
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Asp Thr Asn Ala Gin Asp His Ser Gly Arg Thr Pro Leu His Thr 
1865 1870 1875 

Ala val Thr Ala Asp Ala Gin Gly val Phe Gin lie Leu lie Arg 
1880 1885 1890 

Asn Arg Ser Thr Asp Leu Asp Ala Arg Met Ala Asp Gly Ser Thr 
1895 1900 1905 

Ala Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met val Glu 
1910 1915 1920 

Glu Leu lie Ala ser His Ala Asp val Asn Ala val Asp Glu Leu 
1925 1930 1935 

Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Glu 
1940 1945 1950 

Ala Thr Leu Ala Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin 
1955 1960 1965 

Asp Ser Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly 
1970 1975 1980 

Ser Tyr Glu Ala Ala Lys Leu Leu Leu Asp His Phe Ala Asn Arg 
1985 1990 1995 

Glu lie Thr Asp His Leu Asp Arg Leu Pro Arg Asp Val Ala Gin 
2000 2005 2010 

Glu Arg Leu His Gin Asp He Val Arg Leu Leu Asp Gin Pro Ser 
2015 2020 2025 

Gly Pro Arg Ser Pro Pro Gly Pro His Gly Leu Gly Pro Leu Leu 
2030 2035 2040 

Cvs Pro Pro Gly Ala Phe Leu Pro Gly Leu Lys Ala Ala Gin Ser 
2045 2050 ' 2055 

Gly Ser Lys Lys Ser Arg Arg Pro Pro Gly Lys Ala Gly Leu Gly 
2060 2065 2070 

Pro Gin Gly Pro Arg Gly Arg Gly Lys Lys Leu Thr Leu Ala Cys 
2075 2080 2085 . 

Pro Gly Pro Leu Ala Asp ser Ser Val Thr Leu Ser Pro Val Asp 
2090 2095 2100 

Ser Leu Asp Ser Pro Arg Pro phe Gly Gly Pro Pro Ala Ser Pro 
2105 2110 2115 
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Gly Gly Phe Pro Leu Glu Gly Pro Tyr Ala Ala Ala Thr Ala Thr 
2120 2125 2130 

Ala Val Ser Leu Ala Gin Leu Gly Gly Pro Gly Arg Ala Gly Leu 
2135 2140 2145 

Gly Arg Gin Pro Pro Gly Gly Cys Val Leu Ser Leu Gly Leu Leu 
2150 2155 2160 

Asn Pro val Ala Val Pro Leu Asp Trp Ala Arg Leu Pro Pro Pro 
2165 2170 2175 

Ala Pro Pro Gly Pro Ser Phe Leu Leu Pro Leu Ala Pro Gly Pro 
2180 2185 2190 

Gin Leu Leu Asn Pro Gly Thr Pro Val Ser Pro Gin Glu Arg Pro 
2195 2200 2205 

Pro Pro Tyr Leu Ala Val Pro Gly His Gly Glu Glu Tyr Pro Val 
2210 2215 2220 

Ala Gly Ala His Ser Ser Pro Pro Lys Ala Arg Phe Leu Arg Val 
2225 2230 2235 

Pro Ser Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser Pro Glu 
2240 2245 2250 

His Trp Ala Ser Pro Ser Pro Pro Ser Leu Ser Asp Trp Ser Glu 
2255 2260 2265 

Ser Thr Pro Ser Pro Ala Thr Ala Thr Gly Ala Met Ala Thr Thr 
2270 2275 2280 

Thr Gly Ala Leu Pro Ala Gin Pro Leu Pro Leu Ser val Pro Ser 
2285 2290 2295 

Ser Leu Ala Gin Ala Gin Thr Gin Leu Gly Pro Gin Pro Glu Val 
2300 2305 2310 

Thr Pro Lys Arg Gin Val Leu Ala 
2315 2320 

<210> 20 
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agacgtgagg 
ggctcctgcc 
ttggagaagg 
gctatgtgtc 
tgccaatgga 
tggcttcctg 
aaatggaggc 
tccattgaca 
ggccaagctt 
ccaggcctcg 
gcttcgggac 
cccccagatc 
caacgagtgc 
gggctccttc 
aggaccctgc 
agactccacc 
gaatccagac 
ggacacctac 
ggatgagtgt 
tgctggtagc 
cctggatgac 
ctctttctcc 
gtgtctgagc 
cacactctgc 
gtgtctgatg 
tcctggctcc 
tcacaatgag 
caccttccac 
cgagtgtgcc 
ccagtgcatc 
cagaagctct 
caagtgtctc 
tgacccatgt 
cccctctggt 



cttgcagcag gccgaggagg 
ccagtgagag ctctgagggt 
ggctgtggaa tgcagccccc 
tcagtggtca gacccagagg 
ggcacctgcc tgagcctgtc 
ggtgagacgt gccagtttcc 
agctgccaag ccctgcttcc 
cccagcttct tgtgcacttg 
gaagaccctt gtcctccctc 
ggccgcccac agtgctcctg 
ttctgttcag ccaacccatg 
cagtgccact gcccaccggg 
ttccaggacc caggaccctg 
cagtgcctct gccctgtggg 
cctcctaggg gctgttcgaa 
tttcacctct gcctctgtcc 
aactgtgtca gccaccaatg 
acctgcctct gcccagaaac 
gaggcccagg gtccccctca 
tttcactgcg tgtgtgtgag 
tgtattgctg ccacctgtgc 
tgcctctgcc cacctggacg 
cagccgtgcc atggggatgc 
ctgtgtcagc ctggctattc 
gcccagcaag gcccaagtcc 
ttcaactgcc tctgtccacc 
tgcctctccc agccctgcca 
tgcctctgcc cgccaggctt 
tcagctccct gcctgaacca 
tgcctgcctg gattctccgg 
ccctgtgcca atggtgggca 
ccaggctttg aagggccacg 
cccgttggag ccagctgcct 
ttcacaggcc agctctgtga 



39467A.txt.txt 
aagaagaggg gcagtgggag 

ccctgcctga agagggacag 

ttcactgctg ctgctgctgc 

gctgctgtgt gggagtttcc 

tctgggacaa gggacctgcc 

tgacccctgc cagaacgccc 

cgctccccta gggctcccca 

cctccctggc ttcactggtg 

cttctgttcc aaaaggggcc 

catgcctgga tggacaggtg 

tgttaatgga ggggtgtgtc 

cttcgagggc catgcctgt'g 

ccccaaaggc acctcctgcc 

gcaggagggt ccacgttgtg 

tgggggcacc tgccagctga 

cccaggtttc ataggcccgg 

tcagaatggg ggcacttgcc 

ctggacaggc tgggactgct 

ctgcagaaac gggggcacct 

tggctggggg ggcacaagct 

cccgggatcc acctgcattg 

cacaggactc ctgtgccact 

ccaatgcagc accaaccccc 

ggggcccacc tgccaccagg 

ctgtgaacat ggcggttcct 

tggctacaca ggctcccgtt 

cccaggaagc acctgtctgg 

agaagggcag ctctgtgagg 

cgcggattgc catgacctgc 

cacccgatgt gaggaggata 

gtgccaggac cagcctggag 

ctgtcaaaca gaggtggatg 

tgatcttcca ggagccttct 

ggttcccctg tgtgctccca 



cagaggaggt 


60 


ggactggggc 


120 


tgctgctgct 


180 


cagaaccctg 


240 


agtgtgcccc 


300 


agctctgcca 


360 


gctctccctc 


420 


agagatgcca 


480 


gctgccacat 


540 


agcagtgcca 


600 


tggccacgta 


660 


aacgtgatgt 


720 


ataacaccct 


780 


agctgcgggc 


840 


tgccagagaa 


900 


gctgtgaggt 


960 


aggatgggct 


1020 


ccgaagatgt 


1080 


gccagaactc 


1140 


gtgaggagaa 


1200 


accgggtggg 


1260 


tggaagacat 


1320 


tcacaggctc 


1380 


acctggacga 


1440 


gcctcaacac 


1500 


gtgaggctga 


1560 


acctacttgc 


1620 


tggagaccaa 


1680 


tcaacggctt 


1740 


tcgatgagtg 


1800 


ccttccactg 


1860 


agtgcctgag 


1920 


tttgcctctg 


1980 


acctgtgcca 


2040 
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gcccaagcag atatgtaagg accagaaaga caaggccaac tgcctctgtc ctgatggaag 2100 

ccctggctgt gccccacctg aggacaactg cacctgccac cacgggcact gccagagatc 2160 

ctcatgtgtg tgtgacgtgg gttggacggg gccagagtgt gaggcagagc tagggggctg 2220 

catctctgca ccctgtgccc atggggggac ctgctacccc cagccctctg gctacaactg 2280 

cacctgccct acaggctaca caggacccac ctgtagtgag gagatgacag cttgtcactc 2340 

agggccatgt ctcaatggcg gctcctgcaa ccctagccct ggaggctact actgcacctg 2400 

ccctccaagc cacacagggc cccagtgcca aaccagcact gactactgtg tgtctgcccc 2460 

gtgcttcaat gggggtacct gtgtgaacag gcctggcacc ttctcctgcc tctgtgccat 2520 

gggcttccag ggcccgcgct gtgagggaaa gctccgcccc agctgtgcag acagcccctg 2 580 

taggaatagg gcaacctgcc aggacagccc tcagggtccc cgctgcctct gccccactgg 2640 

ctacaccgga ggcagctgcc agactctgat ggacttatgt gcccagaagc cctgcccacg 2700 

caattcccac tgcctccaga ctgggccctc cttccactgc ttgtgcctcc agggatggac 2760 

cgggcctctc tgcaaccttc cactgtcctc ctgccagaag gctgcactga gccaaggcat 2820 

agacgtctct tccctttgcc acaatggagg cctctgtgtc gacagcggcc cctcctattt 2880 

ctgccactgc ccccctggat tccaaggcag cctgtgccag gatcacgtga acccatgtga 2940 

gtccaggcct tgccagaacg gggccacctg catggcccag cccagtgggt atctctgcca 3000 

gtgtgcccca ggctacgatg gacagaactg ctcaaaggaa ctcgatgctt gtcagtccca 3060 

accctgtcac aaccatggaa cctgtactcc caaacctgga ggcttccact gtgcctgccc 3120 

tccaggcttt gtggggctac gctgtgaggg agacgtggac gagtgtctgg accagccctg 3180 

ccaccccaca ggcactgcag cctgccactc tctggccaat gccttctact gccagtgtct 3240 

gcctggacac acaggccagt ggtgtgaggt ggagatagac ccctgccaca gccaaccctg 3300 

ctttcatgga gggacctgtg aggccacagc aggatcaccc ctgggtttca tctgccactg 3360 

ccccaagggt tttgaaggcc ccacctgcag ccacagggcc ccttcctgcg gcttccatca 3420 

ctgccaccac ggaggcctgt gtctgccctc ccctaagcca ggcttcccac cacgctgtgc 3480 

ctgcctcagt ggctatgggg gtcctgactg cctgacccca ccagctccta aaggctgtgg 3540 

ccctccctcc ccatgcctat acaatggcag ctgctcagag accacgggct tggggggccc 3600 

aggctttcga tgctcctgcc ctcacagctc tccagggccc cggtgtcaga aacccggagc 3660 

caaggggtgt gagggcagaa gtggagatgg ggcctgcgat gctggctgca gtggcccggg 3720 

aggaaactgg gatggagggg actgctctct gggagtccca gacccctgga agggctgccc 3780 

ctcccactct cggtgctggc ttctcttccg ggacgggcag tgccacccac agtgtgactc 3840 

tgaagagtgt ctgtttgatg gctacgactg tgagacccct ccagcctgca ctccagccta 3900 

tgaccagtac tgccatgatc acttccacaa cgggcactgt gagaaaggct gcaacactgc 3960 

agagtgtggc tgggatggag gtgactgcag gcctgaagat ggggacccag agtgggggcc 4020 

ctccctggcc ctgctggtgg tactgagccc cccagcccta gaccagcagc tgtttgccct 4080 
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ggcccgggtg 


ctgtccctga 


ctctgagggt aggactctgg gtaaggaagg 


atcgtgatgg 


4140 


cagggacatg 


gtgtacccct 


atcctggggc ccgggctgaa gaaaagctag 


gaggaactcg 


4200 


ggaccccacc 


tatcaggaga 


gagcagcccc tcaaacacag cccctgggca 


aggagaccga 


4260 


ctccctcagt 


gctgggtttg 


tggtggtcat gggtgtggat ttgtcccgct 


gtggccctga 


4320 


ccacccggca 


tcccgctgtc 


cctgggaccc tgggcttcta ctccgcttcc 


ttgctgcgat 


4380 


ggctgcagtg 


ggagccctgg 


agcccctgct gcctggacca ctgctggctg 


tccaccctca 


4440 


tgcagggacc 


gcaccccctg 


ccaaccagct tccctggcct gtgctgtgct 


ccccagtggc 


4500 


cggggtgatt 


ctcctggccc 


taggggctct tctcgtcctc cagctcatcc 


ggcgtcgacg 


4560 


ccgagagcat 


ggagctctct 


ggctgccccc tggtttcact cgacggcctc 


ggactcagtc 


4620 


agctccccac 


cgacgccggc 


ccccactagg cgaggacagc attggtctca 


aggcactgaa 


4680 


gccaaaggca 


gaagttgatg 


aggatggagt tgtgatgtgc tcaggccctg 


aggagggaga 


4740 


ggaggtgggc 


caggctgaag 


aaacaggccc accctccacg tgccagctct 


ggtctctgag 


4800 


tggtggctgt 


ggggcgctcc 


ctcaggcagc catgctaact cctccccagg 


aatctgagat 


4860 


ggaagcccct 


gacctggaca 


cccgtggacc tgatggggtg acacccctga 


tgtcagcagt 


4920 


ttgctgtggg 


gaagtacagt 


ccgggacctt ccaaggggca tggttgggat 


gtcctgagcc 


4980 


ctgggaacct 


ctgctggatg 


gaggggcctg tccccaggct cacaccgtgg 


gcactgggga 


5040 


gacccccctg 


cacctggctg 


cccgattctc ccggccaacc gctgcccgcc 


gcctccttga 


5100 


ggctggagcc 


aaccccaacc 


agccagaccg ggcagggcgc acaccccttc 


atgctgctgt 


5160 


ggctgctgat 


gctcgggagg 


tctgccagct tctgctccgt agcagacaaa 


ctgcagtgga 


5220 


cgctcgcaca 


gaggacggga 


ccacaccctt gatgctggct gccaggctgg 


cggtggaaga 


5280 


cctggttgaa 


gaactgattg 


cagcccaagc agacgtgggg gccagagata 


aatgggggaa 


5340 


aactgcgctg 


cactgggctg 


ctgccgtgaa caacgcccga gccgcccgct 


cgcttctcca 


5400 


ggccggagcc 


gataaagatg 


cccaggacaa cagggagcag acgccgctat 


tcctggcggc 


5460 


gcgggaagga 


gcggtggaag 


tagcccagct actgctgggg ctgggggcag 


cccgagagct 


5520 


gcgggaccag 


gctgggctag 


cgccggcgga cgtcgctcac caacgtaacc 


actgggatct 


5580 


gctgacgctg 


ctggaagggg 


ctgggccacc agaggcccgt cacaaagcca 


cgccgggccg 


5640 


cgaggctggg 


cccttcccgc 


gcgcacggac ggtgtcagta agcgtgcccc 


cgcatggggg 


5700 


cggggctctg 


ccgcgctgcc 


ggacgctgtc agccggagca ggccctcgtg 


ggggcggagc 


5760 


ttgtctgcag 


gctcggactt 


ggtccgtaga cttggctgcg cgggggggcg 


gggcctattc 


5820 


tcattgccgg 


agcctctcgg 


gagtaggagc aggaggaggc ccgacccctc 


gcggccgtag 


5880 


gttttctgca 


ggcatgcgcg 


ggcctcggcc caaccctgcg ataatgcgag 


gaagatacgg 


5940 


agtggctgcc 


gggcgcggag 


gcagggtctc aacggatgac tggccctgtg 


attgggtggc 


6000 


cctgggagct 


tgcggttctg 


cctccaacat tccgatcccg cctccttgcc 


ttactccgtc 


6060 


cccggagcgg 


ggatcacctc 


aacttgactg tggtccccca gccctccaag 


aaatgcccat 


6120 
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aaaaatagaa gaatacatgg 


.txt 

tagggaggaa 


ttccaaaaat 




gattacccat 


taaaaggcag 


gctggaaggc cttcctggtt 


ttaagatgga 


tcccccaaaa 


6240 


tgaagggttg 


tgagtttagt 


ttctctccta aaatgaatgt 


atgcccacca 


gagcagacat 


6300 


cttccacgtg 


gagaagctgc 


agctctggaa agagggttta 


agatgctagg 


atgaggcagg 




cccagtcctc 


ctccagaaaa 


taagacaggc cacaggaggg 


cagagtggag 


tggaaatacc 




cctaagttgg 


aaccaagaat 


tgcaggcata tgggatgtaa 


gatgttcttt 


cctatatatg 




gtttccaaag 


ggtgccccta 


tgatccattg tccccactgc 


ccacaaatgg 


ctgacaaata 


6540 


tttattgggc 


acctactatg 


tgccaggcac tgtgtaggtg 


ctgaaaagtg 


gccaagggcc 


6600 


acccccgctg 


atgactcctt 


gcattccctc ccctcacaac 


aaagaactcc 


actgtgggga 


6660 


tgaagcgctt 


cttctagcca 


ctgctatcgc tatttaagaa 


ccctaaatct 


gtcacccata 


6720 


ataaagctga 


tttgaagtgt 


taaaaaaaaa aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


6780 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa aaaaaaaaaa 


aaaaaaaaaa 


aaaaaa 


6836 



<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Notch-4 



Met Gin Pro Pro Ser Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Cys 
15 10 15 

Val Ser Val Val Arg Pro Arg Gly Leu Leu Cys Gly Ser Phe Pro Glu 
20 25 30 

pro cys Ala Asn Gly Gly Thr cys Leu Ser Leu Ser Leu Gly Gin Gly 
35 40 45 

Thr cys Gin cys Ala Pro Gly Phe Leu Gly Glu Thr cys Gin Phe Pro 
50 55 60 

Asp Pro cys Gin Asn Ala Gin Leu cys Gin Asn Gly Gly Ser cys Gin 



Ala Leu Leu Pro Ala Pro Leu Gly Leu Pro Ser Ser Pro Ser Pro Leu 
85 90 95 

Thr pro ser Phe Leu Cys Thr Cys Leu Pro Gly Phe Thr Gly Glu Arg 
100 105 HO 

cys Gin Ala Lys Leu Glu Asp Pro Cys Pro Pro Ser Phe Cys Ser Lys 
115 120 125 
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Arg Gly Arg cys His lie Gin Ala ser Gly Arg pro Gin Cys Ser cys 
130 135 140 

Met Pro Gly Trp Thr Gly Glu Gin Cys Gin Leu Arg Asp Phe Cys Ser 
145 150 155 160 

Ala Asn Pro cys val Asn Gly Gly val cys Leu Ala Thr Tyr Pro Gin 
165 170 175 

lie Gin Cys His Cys Pro Pro Gly Phe Glu Gly His Ala Cys Glu Arg 
180 185 190 

Asp val Asn Glu Cys Phe Gin Asp Pro Gly pro cys pro Lys Gly Thr 
195 200 205 

Ser Cys His Asn Thr Leu Gly ser Phe Gin Cys Leu Cys Pro Val Gly 
210 215 220 

Gin Glu Gly Pro Arg Cys Glu Leu Arg Ala Gly Pro Cys Pro Pro Arg 
225 230 235 240 

Gly Cys Ser Asn Gly Gly Thr cys Gin Leu Met Pro Glu Lys Asp ser 
245 250 255 

Thr Phe His Leu Cys Leu Cys Pro Pro Gly Phe lie Gly Pro Gly Cys 
260 265 270 

Glu val Asn Pro Asp Asn cys Val Ser His Gin cys Gin Asn Gly Gly 
275 280 285 

Thr cys Gin Asp Gly Leu Asp Thr Tyr Thr cys Leu cys Pro Glu Thr 
290 295 300 

Trp Thr Gly Trp Asp Cys Ser Glu Asp Val Asp Glu Cys Glu Ala Gin 
305 310 315 320 

Gly Pro Pro His Cys Arg Asn Gly Gly Thr cys Gin Asn Ser Ala Gly 
325 330 335 

ser phe His cys Val Cys Val Ser Gly Trp Gly Gly Thr Ser Cys Glu 
340 345 350 

Glu Asn Leu Asp Asp Cys He Ala Ala Thr cys Ala pro Gly ser Thr 
355 360 365 

Cys lie Asp Arg Val Gly Ser Phe Ser Cys Leu cys Pro Pro Gly Arg 
370 375 380 

Thr Gly Leu Leu Cys His Leu Glu Asp Met Cys Leu Ser Gin Pro Cys 
385 390 395 400 
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His Gly Asp Ala Gin Cys Ser Thr Asn Pro Leu Thr Gly Ser Thr Leu 
405 410 415 

Cys Leu cys Gin Pro Gly Tyr Ser Gly pro Thr cys His Gin Asp Leu 
420 ' 425 430 

Asp Glu cys Leu Met Ala Gin Gin Gly Pro ser Pro cys Glu His Gly 
435 440 445 

Gly Ser cys Leu Asn Thr Pro Gly ser Phe Asn cys Leu Cys Pro Pro 
450 455 460 

Gly Tyr Thr Gly Ser Arg Cys Glu Ala Asp His Asn Glu Cys Leu Ser 
465 470 475 480 

Gin Pro Cys His Pro Gly Ser Thr Cys Leu Asp Leu Leu Ala Thr Phe 
485 490 495 

His Cys Leu Cys Pro Pro Gly Leu Glu Gly Gin Leu cys Glu Val Glu 
500 505 510 

Thr Asn Glu Cys Ala Ser Ala Pro Cys Leu Asn His Ala Asp cys His 
515 520 525 

Asp Leu Leu Asn Gly Phe Gin cys lie cys Leu Pro Gly Phe ser Gly 
H 530 .535 540 

Thr Arg cys Glu Glu Asp lie Asp Glu cys Arg ser ser Pro Cys Ala 
545 550 555 560 

Asn Gly Gly Gin Cys Gin Asp Gin Pro Gly Ala Phe His Cys Lys Cys 
565 570 575 

Leu Pro Gly Phe Glu Gly Pro Arg cys Gin Thr Glu Val Asp Glu Cys 
580 585 590 

Leu ser Asp Pro Cys Pro Val Gly Ala Ser Cys Leu Asp Leu Pro Gly 
595 600 605 

Ala Phe Phe Cys Leu Cys Pro Ser Gly Phe Thr Gly Gin Leu Cys Glu 
610 615 620 

Val Pro Leu Cys Ala Pro Asn Leu Cys Gin Pro Lys Gin lie Cys Lys 
625 630 635 640 

Asp Gin Lys Asp Lys Ala Asn cys Leu cys Pro Asp Gly ser Pro Gly 
645 650 655 

cys Ala Pro Pro Glu Asp Asn Cys Thr cys His His Gly His cys Gin 
660 665 670 



WO 2005/014854 PCT/EP2004/008819 
39467A.txt.txt 

Arg Ser ser Cys Val Cys Asp Val Gly Trp Thr Gly Pro Glu Cys Glu 
675 680 685 

Ala Glu Leu Gly Gly Cys lie Ser Ala Pro Cys Ala His Gly Gly Thr 
690 695 700 

Cys Tyr Pro Gin Pro Ser Gly Tyr Asn Cys Thr Cys Pro Thr Gly Tyr 
705 710 715 720 

Thr Gly Pro Thr cys Ser Glu Glu Met Thr Ala Cys His Ser Gly Pro 
725 730 735 

Cys Leu Asn Gly Gly Ser Cys Asn Pro Ser Pro Gly Gly Tyr Tyr cys 
740 745 750 

Thr cys Pro Pro Ser His Thr Gly Pro Gin cys Gin Thr ser Thr Asp 
755 760 765 

Tyr cys val ser Ala Pro cys phe Asn Gly Gly Thr cys Val Asn Arg 
770 775 780 

Pro Gly Thr Phe Ser Cys Leu Cys Ala Met Gly Phe Gin Gly Pro Arg 
785 790 795 800 

Cys Glu Gly Lys Leu Arg Pro ser cys Ala Asp ser Pro cys Arg Asn 
805 810 815 

Arg Ala Thr Cys Gin Asp Ser Pro Gin Gly Pro Arg Cys Leu Cys Pro 
820 825 830 

Thr Gly Tyr Thr Gly Gly Ser Cys Gin Thr Leu Met Asp Leu cys Ala 
835 840 845 

Gin Lys Pro cys Pro Arg Asn ser His Cys Leu Gin Thr Gly Pro Ser 
850 855 860 

Phe His cys Leu cys Leu Gin Gly Trp Thr Gly Pro Leu cys Asn Leu 
865 870 875 880 

Pro Leu Ser Ser cys Gin Lys Ala Ala Leu ser Gin Gly lie Asp Val 
885 890 895 

Ser Ser Leu cys His Asn Gly Gly Leu Cys Val Asp Ser Gly Pro Ser 
900 905 910 

Tyr Phe Cys His cys Pro Pro Gly Phe Gin Gly Ser Leu Cys Gin Asp 
915 920 925 

His Val Asn Pro Cys Glu Ser Arg Pro Cys Gin Asn Gly Ala Thr cys 
930 935 940 
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Met Ala Gin Pro Ser Gly Tyr Leu Cys Gin Cys Ala Pro Gly Tyr Asp 
945 950 ' 955 960 

Gly Gin Asn cys Ser Lys Glu Leu Asp Ala Cys Gin Ser Gin Pro Cys 
965 970 975 

His Asn His Gly Thr Cys Thr Pro Lys Pro Gly Gly Phe His Cys Ala 
980 985 990 

Cys Pro Pro Gly Phe Val Gly Leu Arg Cys Glu Gly Asp Val Asp Glu 
995 1000 1005 

Cys Leu Asp Gin Pro Cys His Pro Thr Gly Thr Ala Ala Cys His 

1010 1015 1020 

Ser Leu Ala Asn Ala Phe Tyr cys Gin cys Leu Pro Gly His Thr 

1025 1030 1035 

Gly Gin Trp Cys Glu Val Glu lie Asp Pro Cys His Ser Gin Pro 

1040 1045 1050 

Cys Phe His Gly Gly Thr Cys Glu Ala Thr Ala Gly Ser Pro Leu 

1055 1060 1065 

Gly Phe lie cys His cys pro Lys Gly Phe Glu Gly Pro Thr cys 

1070 1075 1080 

Ser His Arg Ala Pro Ser Cys Gly Phe His His Cys His His Gly 

1085 1090 1095 

Gly Leu cys Leu Pro ser Pro Lys Pro Gly Phe Pro Pro Arg cys 

1100 1105 1110 

Ala Cys Leu Ser Gly Tyr Gly Gly Pro Asp Cys Leu Thr Pro Pro 

1115 1120 1125 

Ala Pro Lys Gly Cys Gly Pro Pro Ser Pro Cys Leu Tyr Asn Gly 

1130 1135 1140 

ser cys ser Glu Thr Thr Gly Leu Gly Gly Pro Gly Phe Arg Cys 

1145 1150 1155 

Ser Cys Pro His Ser Ser Pro Gly Pro Arg Cys Gin Lys Pro Gly 

1160 1165 1170 

Ala Lys Gly Cys Glu Gly Arg Ser Gly Asp Gly Ala Cys Asp Ala 

1175 1180 1185 

Gly cys ser Gly Pro Gly Gly Asn Trp Asp Gly Gly Asp cys ser 

1190 1195 1200 
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Leu Gly Val Pro Asp Pro Trp Lys Gly cys Pro Ser His Ser Arg 

1205 1210 1215 

Cys Trp Leu Leu Phe Arg Asp Gly Gin cys His Pro Gin Cys Asp 

1220 1225 1230 

Ser Glu Glu Cys Leu Phe Asp Gly Tyr Asp Cys Glu Thr Pro Pro 

1235 1240 1245 

Ala cys Thr Pro Ala Tyr Asp Gin Tyr cys His Asp His Phe His 

1250 1255 1260 

Asn Gly His Cys Glu Lys Gly Cys Asn Thr Ala Glu Cys Gly Trp 

1265 1270 1275 

Asp Gly Gly Asp cys Arg Pro Glu Asp Gly Asp Pro Glu Trp Gly 

1280 1285 1290 

pro Ser Leu Ala Leu Leu Val Val Leu Ser Pro Pro Ala Leu Asp 

1295 1300 1305 

Gin Gin Leu Phe Ala Leu Ala Arg Val Leu Ser Leu Thr Leu Arg 

1310 1315 1320 

Val Gly Leu Trp val Arg Lys Asp Arg Asp Gly Arg Asp Met val 

1325 1330 1335 

Tyr Pro Tyr Pro Gly Ala Arg Ala Glu Glu Lys Leu Gly Gly Thr 

1340 1345 1350 

Arg Asp Pro Thr Tyr Gin Glu Arg Ala Ala Pro Gin Thr Gin Pro 

1355 1360 1365 

Leu Gly Lys Glu Thr Asp Ser Leu Ser Ala Gly Phe Val Val Val 

1370 1375 1380 

Met Gly Val Asp Leu Ser Arg Cys Gly Pro Asp His Pro Ala Ser 

1385 1390 1395 

Arg Cys Pro Trp Asp Pro Gly Leu Leu Leu Arg Phe Leu Ala Ala 

1400 1405 1410 

Met Ala Ala Val Gly Ala Leu Glu Pro Leu Leu Pro Gly Pro Leu 

1415 1420 1425 

Leu Ala Val His Pro His Ala Gly Thr Ala Pro Pro Ala Asn Gin 

1430 1435 1440 

Leu Pro Trp Pro Val Leu Cys Ser Pro Val Ala Gly Val lie Leu 

1445 1450 1455 
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Leu Ala Leu Gly Ala Leu Leu Val Leu Gin Leu lie Arg Arg Arg 
1460 1465 1470 

Arg Arg Glu His Gly. Ala Leu Trp Leu Pro Pro Gly Phe Thr Arg 
1475 1480 1485 

Arg Pro Arg Thr Gin Ser Ala Pro His Arg Arg Arg Pro Pro Leu 
1490 1495 1500 

Gly Glu Asp Ser lie Gly Leu Lys Ala Leu Lys Pro Lys Ala Glu 
1505 1510 1515 

val Asp Glu Asp Gly val val Met cys ser Gly Pro Glu Glu Gly 
1520 1525 1530 

Glu Glu Val Gly Gin Ala Glu Glu Thr Gly Pro Pro Ser Thr Cys 
1535 1540 1545 

Gin Leu Trp ser Leu Ser Gly Gly Cys Gly Ala Leu Pro Gin Ala 
1550 1555 1560 

Ala Met Leu Thr Pro Pro Gin Glu Ser Glu Met Glu Ala Pro Asp 
1565 1570 1575 

Leu Asp Thr Arg Gly Pro Asp Gly Val Thr Pro Leu Met Ser Ala 
1580 1585 1590 

Val Cys Cys Gly Glu val Gin ser Gly Thr phe Gin Gly Ala Trp 
1595 1600 1605 

Leu Gly Cys Pro Glu Pro Trp Glu Pro Leu Leu Asp Gly Gly Ala 
1610 1615 1620 

Cys Pro Gin Ala His Thr Val Gly Thr Gly Glu Thr Pro Leu His 
1625 1630 1635 

Leu Ala Ala Arg Phe ser Arg Pro Thr Ala Ala Arg Arg Leu Leu 
1640 1645 1650 

Glu Ala Gly Ala Asn Pro Asn Gin Pro Asp Arg Ala Gly Arg Thr 
1655 1660 1665 

pro Leu His Ala Ala Val Ala Ala Asp Ala Arg Glu val cys Gin 
1670 1675 1680 

Leu Leu Leu Arg ser Arg Gin Thr Ala Val Asp Ala Arg Thr Glu 
1685 1690 1695 

Asp Gly Thr Thr Pro Leu Met Leu Ala Ala Arg Leu Ala Val Glu 
1700 1705 1710 



93/166 



WO 2005/014854 PCT/EP2004/008819 
39467A.tXt.txt 

Asp Leu val Glu Glu Leu lie Ala Ala Gin Ala Asp Val Gly Ala 
1715 1720 1725 

Arg Asp Lys Trp Gly Lys Thr Ala Leu His Trp Ala Ala Ala Val 
1730 1735 1740 

Asn Asn Ala Arg Ala Ala Arg ser Leu Leu Gin Ala Gly Ala Asp 
1745 1750 1755 

Lys Asp Ala Gin Asp Asn Arg Glu Gin Thr Pro Leu Phe Leu Ala 
1760 1765 1770 

Ala Arg Glu Gly Ala Val Glu val Ala Gin Leu Leu Leu Gly Leu 
1775 1780 1785 

Gly Ala Ala Arg Glu Leu Arg Asp Gin Ala Gly Leu Ala Pro Ala 
1790 1795 1800 

Asp Val Ala His Gin Arg Asn His Trp Asp Leu Leu Thr Leu Leu 
1805 1810 1815 

Glu Gly Ala Gly Pro Pro Glu Ala Arg His Lys Ala Thr Pro Gly 



Arg Glu Ala Gly Pro Phe Pro Arg Ala Arg Thr Val Ser Val Ser 
1835 1840 1845 

Val Pro Pro His Gly Gly Gly Ala Leu Pro Arg Cys Arg Thr Leu 
1850 1855 I860 

Ser Ala Gly Ala Gly Pro Arg Gly Gly Gly Ala Cys Leu Gin Ala 
1865 1870 1875 

Arq Thr Trp Ser Val Asp Leu Ala Ala Arg Gly Gly Gly Ala Tyr 
1880 1885 1890 

ser His cys Arg ser Leu ser Gly val Gly Ala Gly Gly Gly Pro 
1895 1900 1905 

Thr pro Arg Gly Arg Arg Phe Ser Ala Gly Met Arg Gly Pro Arg 
1910 1915 1920 

Pro Asn Pro Ala lie Met Arg Gly Arg Tyr Gly val Ala Ala Gly 
1925 1930 1935 

Arq Gly Gly Arg Val Ser Thr Asp Asp Trp pro cys Asp Trp Val 
1940 1945 1950 

Ala Leu Gly Ala cys Gly Ser Ala ser Asn He Pro lie Pro Pro 
1955 I960 1965 
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Pro Cys Leu Thr Pro Ser Pro Glu Arg Gly ser Pro Gin Leu Asp 
1970 1975 1980 

Cys Gly Pro Pro Ala Leu Gin Glu Met Pro lie Asn Gin Gly Gly 
1985 1990 1995 



Glu Gly^ Lys Lys 



<210> 22 

<211> 5896 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Jagged-1 












<400> 22 
ctgcggccgg 


cccgcgagct 


aggctgggtt 


tttttttttc 


tcccctccct 


cccccctttt 


60 


tccatgcagc 


tgatctaaaa 


gggaataaaa 


ggctgcgcat 


aatcataata 


ataaaagaag 


120 


gggagcgcga 


gagaaggaaa 


gaaagccggg 


aggtggaaga 


aaaaaaaaaa 


cgtctcaaag 


180 


aagcgatcag 


aataataaaa 


ggaggccggg 


ctctttgcct 


tctggaacgg 


gccgctcttg 


240 


aaagggcttt 


tgaaaagtgg 


tgttgttttc 


cagtcgtgca 


tgctccaatc 


ggcggagtat 


300 


attagagccg 


ggacgcggcg 




cagcggcgac 


ggcagcaccg 


gcggcagcac 


360 


cagcgcgaac 


agcagcggcg 


acatcccaaa 
y y yy 


tgcccgcggc 


gcgcggcgca 


gcgatgcgtt 


420 


ccccacggac 


gcgcggccgg 


tccgggcgcc 


ccctaagcct 


cctgctcgcc 


ctgctctgtg 


480 


ccctgcgagc 


caaggtgtgt 




gtcagttcga 


attaaaaatc 


ctgtccatgc 


540 


agaacgtgaa 


cggggagctg 


cagaacggga 


actgctgcgg 


cggcgcccgg 


aacccgggag 


600 


accgcaagtg 


cacccgcgac 


gagtgtgaca 


catacttcaa 


agtgtgcctc 


aaggagtatc 


660 


agtcccgcgt 


cacggccggg 


gggccctgca 


gcttcggctc 


agggtccacg 


cctgtcatcg 


720 


ggggcaacac 


cttcaacctc 


aaggccagcc 


gcggcaacga 


ccgcaaccgc 


atcgtgctgc 


780 


ctttcagttt 


cgcctggccg 


aggtcctata 


cgttgcttgt 


ggaggcgtgg 


gattccagta 


840 


atgacaccgt 


tcaacctgac 


agtattattg 


aaaaggcttc 


tcactcgggc 


atgatcaacc 


900 


ccagccggca 


gtggcagacg 


ctgaagcaga 


acacgggcgt 


tgcccacttt 


gagtatcaga 


960 


tccgcgtgac 


ctgtgatgac 


tactactatg 


gctttggctg 


caataagttc 


tgccgcccca 


1020 


gagatgactt 


ctttggacac 


tatgcctgtg 


accagaatgg 


caacaaaact 


tgcatggaag 


1080 


gctggatggg 


ccccgaatgt 


aacagagcta 


tttgccgaca 


aggctgcagt 


cctaagcatg 


1140 


ggtcttgcaa 


actcccaggt 


gactgcaggt 


gccagtatgg 


ctggcaaggc 


ctgtactgtg 


1200 


ataagtgcat 


cccacacccg 


ggatgcgtcc 


acggcatctg 


taatgagccc 


tggcagtgcc 


1260 


tctgtgagac 


caactggggc 


ggccagctct 


gtgacaaaga 


tctcaattac 


tgtgggactc 


1320 
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atcagccgtg tctcaacggg ggaacttgta gcaacacagg ccctgacaaa tatcagtgtt 1380 

cctgccctga ggggtattca ggacccaact gtgaaattgc tgagcacgcc tgcctctctg 1440 

atccctgtca caacagaggc agctgtaagg agacctccct gggctttgag tgtgagtgtt 1500 

ccccaggctg gaccggcccc acatgctcta caaacattga tgactgttct cctaataact 1560 

gttcccacgg gggcacctgc caggacctgg ttaacggatt taagtgtgtg tgccccccac 1620 

agtggactgg gaaaacgtgc cagttagatg caaatgaatg tgaggccaaa ccttgtgtaa 1680 

acgccaaatc ctgtaagaat ctcattgcca gctactactg cgactgtctt cccggctgga 1740 

tgggtcagaa ttgtgacata aatattaatg actgccttgg ccagtgtcag aatgacgcct 1800 

cctgtcggga tttggttaat ggttatcgct gtatctgtcc acctggctat gcaggcgatc 1860 

actgtgagag agacatcgat gaatgtgcca gcaacccctg tttgaatggg ggtcactgtc 1920 

agaatgaaat caacagattc cagtgtctgt gtcccactgg tttctctgga aacctctgtc 1980 

agctggacat cgattattgt gagcctaatc cctgccagaa cggtgcccag tgctacaacc 2040 

gtgccagtga ctatttctgc aagtgccccg aggactatga gggcaagaac tgctcacacc 2100 

tgaaagacca ctgccgcacg accccctgtg aagtgattga cagctgcaca gtggccatgg 2160 

cttccaacga cacacctgaa ggggtgcggt atatttcctc caacgtctgt ggtcctcacg 2220 

ggaagtgcaa gagtcagtcg ggaggcaaat tcacctgtga ctgtaacaaa ggcttcacgg 2280 

gaacatactg ccatgaaaat attaatgact gtgagagcaa cccttgtaga aacggtggca 2340 

cttgcatcga tggtgtcaac tcctacaagt gcatctgtag tgacggctgg gagggggcct 2400 

actgtgaaac caatattaat gactgcagcc agaacccctg ccacaatggg ggcacgtgtc 2460 

gcgacctggt caatgacttc tactgtgact gtaaaaatgg gtggaaagga aagacctgcc 2520 

actcacgtga cagtcagtgt gatgaggcca cgtgcaacaa cggtggcacc tgctatgatg 2580 

agggggatgc ttttaagtgc atgtgtcctg gcggctggga aggaacaacc tgtaacatag 2640 

cccgaaacag tagctgcctg cccaacccct gccataatgg gggcacatgt gtggtcaacg 2700 

gcgagtcctt tacgtgcgtc tgcaaggaag gctgggaggg gcccatctgt gctcagaata 2760 

ccaatgactg cagccctcat ccctgttaca acagcggcac ctgtgtggat ggagacaact 2820 

ggtaccggtg cgaatgtgcc ccgggttttg ctgggcccga ctgcagaata aacatcaatg 2880 

aatgccagtc ttcaccttgt gcctttggag cgacctgtgt ggatgagatc aatggctacc 2940 

ggtgtgtctg ccctccaggg cacagtggtg ccaagtgcca ggaagtttca gggagacctt 3000 

gcatcaccat ggggagtgtg ataccagatg gggccaaatg ggatgatgac tgtaatacct 3060 

gccagtgcct gaatggacgg atcgcctgct caaaggtctg gtgtggccct cgaccttgcc 3120 

tgctccacaa agggcacagc gagtgcccca gcgggcagag ctgcatcccc atcctggacg 3180 

accagtgctt cgtccacccc tgcactggtg tgggcgagtg tcggtcttcc agtctccagc 3240 

cggtgaagac aaagtgcacc tctgactcct attaccagga taactgtgcg aacatcacat 3300 

ttacctttaa caaggagatg atgtcaccag gtcttactac ggagcacatt tgcagtgaat 3360 
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tgaggaattt 


gaatattttg 


39467A.txt 
aagaatgttt ccgctgaata 


.txt 

ttcaatctac 


ategcttgeg 


3420 


agccttcccc 


ttcagcgaac 


aatgaaatac atgtggccat 


ttctgctgaa 


gatataeggg 


3480 


atgatgggaa 


cccgatcaag 


gaaatcactg 


acaaaataat 


cgatcttgtt 


agtaaacgtg 


3540 


atggaaacag 


ctcgctgatt 


gctgccgttg 


cagaagtaag 


agttcagagg 


cggcctctga 




agaacagaac 


agatttcctt 


gttcccttgc tgagctctgt 


cttaactgtg 


gcttggatct 


3660 


gttgcttggt 


gacggccttc 


tactggtgcc tgcggaagcg 


gcggaagccg 


ggcagccaca 


3720 


cacactcagc 


ctctgaggac 


aacaccacca 


acaacgtgcg 


ggagcagctg 


aaccagatca 




aaaaccccat 


tgagaaacat 


ggggccaaca 


cggtccccat 


caaggattac 


gagaacaaga 


3840 


actccaaaat 


gtctaaaata 


aggacacaca 


attctgaagt 


agaagaggac 


gacatggaca 




aacaccagca 


gaaagcccgg 


tttgccaagc 


agccggcgta 


tacgctggta 


gacagagaag 




agaagccccc 


caacggcacg 


ccgacaaaac 


acccaaactg 


gacaaacaaa 


caggacaaca 




gagacttgga 


aagtgcccag 


agcttaaacc 


gaatggagta 


cat eg tat ag 


cagaccgcgg 




gcactgccgc 


cgctaggtag 


agtctgaggg 


cttgtagttc 


tttaaactgt 


cgtgtcatac 




tcgagtctga 


ggccgttgct 


gacttagaat 


ccctgtgtta 


atttaagttt 


tgacaagctg 




gcttacactg 


gcaatggtag 


tttctgtggt 


tggctgggaa 


atcgagtg cc 


q c a."t c t c a. c 3. 




gctatgcaaa 


aagctagtca 


acagtaccct 


ggttgtgtgt 


ccccttgcag 


ccga.ca.cQQti 




ctcggatcag 


gctcccagga 


gcctgcccag 


ccccctggtc 






4380 


agatgtccta 


atggtgatgc 


agtcttagat 


catagtttta 




ttgactcttg 




agttgttttt 


gtatattggt 


tttatgatga 


cgtacaagta 


gttctgtatt 






tttgcagctc 


agaaccacag 


caacgatcac 


aaatgacttt 




11111111113311119 


4560 


tatttttgtt 


gttgggggag 


gggagacttt 


gatgtcagca 


gttgctggta 


33 3 tl 9 3-3. 9 3-3. 




tttaaagaaa 


aaaatgtcaa 


aagtagaact 


ttgtatagtt 


atgtaaataa 


"t "t C "t "t "t "t "t "t a 


4680 


ttaatcactg 


tgtatatttg 


atttattaac 


ttaataatca 


ag3gCCt1l33 






ctttttattt 


atatgtatgt 


gtttagaatt gaaggttttt 


gatagcattg 


taagcgtatg 


4800 


gctttatttt 


tttgaactct 


tctcattact tgttgcctat 


3 a. 9 C C 3.3. 3 3 "t 


"t3399^-9^-^-^- 




gaaaatagtt 


tattttaaaa 


caataggatg 


ggcttctgtg 


CCC3Q33"t3C 




4920 


ttttttgtac 


gacgtcagat 


gtttaaaaca 


ccttctatag 




33030911111111 


4980 


aaggactgac 


tgaggcagtt 


tgaggattag tttagaacag 


9 "t "t 1 1 "t "t 1 9 "t 


ttgli'ttg't't't 


5040 


tttgtttttc 


tgctttagac 


ttgaaaagag acaggcaggt 


gatctgetge 


agagcagtaa 


5100 


gggaacaagt 


tgagctatga 


cttaacatag 


ccaaaatgtg 


agtggttgaa 




5160 


aatatcaaat 


taattgtgtg 


aacttggaag 


cacaccaatc 


tgactttgta 




5220 


tcttttcacc 


attcgtacat 


aatactgaac cacttgtaga 


tttgattttt 


tttttaatct 


5280 


actgcattta 


gggagtattc 


taataagcta gttgaatact 


tgaaccataa 


aatgtccagt 


5340 


aagatcactg 


tttagatttg 


ccatagagta cactgcctgc 


cttaagtgag 


gaaatcaaag 


5400 
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tgctattacg aagttcaaga tcaaaaaggc ttataaaaca gagtaatctt gttggttcac 5460 

cattgagacc gtgaagatac tttgtattgt cctattagtg ttatatgaac atacaaatgc 5520 

atctttgatg tgttgttctt ggcaataaat tttgaaaagt aatatttatt aaattttttt 5580 

gtatgaaaac atggaacagt gtggctcttc tgagcttacg tagttctacc ggctttgccg 5640 

tgtgcttctg ccaccctgct gagtctgttc tggtaatcgg ggtataatag gctctgcctg 5700 

acagagggat ggaggaagaa ctgaaaggct tttcaaccac aaaactcatc tggagttctc 5760 

aaagacctgg ggctgctgtg aagctggaac tgcgggagcc ccatctaggg gagccttgat 5820 

tcccttgtta ttcaacagca agtgtgaata ctgcttgaat aaacaccact ggattaatgg 5880 

aaaaaaaaaa aaaaaa 5896 

<210> 23 

<211> 1218 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Jagged-l 



Met Arg Ser Pro Arg Thr Arg Gly Arg ser Gly Arg pro Leu Ser Leu 
15 10 15 

Leu Leu Ala Leu Leu Cys Ala Leu Arg Ala Lys Val Cys Gly Ala Ser 
20 25 30 

Gly Gin Phe Glu Leu Glu lie Leu Ser Met Gin Asn Val Asn Gly Glu 
35 40 45 

Leu Gin Asn Gly Asn cys cys Gly Gly Ala Arg Asn Pro Gly Asp Arg 
50 55 60 

Lys Cys Thr Arg Asp Glu Cys Asp Thr Tyr Phe Lys Val Cys Leu Lys 
65 70 75 80 

Glu Tyr Gin Ser Arg Val Thr Ala Gly Gly Pro cys Ser phe Gly Ser 
85 90 95 

Gly ser Thr Pro Val lie Gly Gly Asn Thr Phe Asn Leu Lys Ala Ser 
100 105 110 

Arg Gly Asn Asp Arg Asn Arg He Val Leu Pro Phe Ser Phe Ala Trp 
115 120 125 

Pro Arg Ser Tyr Thr Leu Leu Val Glu Ala Trp Asp Ser Ser Asn Asp 
130 135 140 

Thr val Gin Pro Asp ser lie He Glu Lys Ala Ser His Ser Gly Met 
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145 150 155 160 

lie Asn Pro Ser Arg Gin Trp Gin Thr Leu Lys Gin Asn Thr Gly Val 
165 170 175 

Ala His phe Glu Tyr Gin lie Arg Val Thr Cys Asp Asp Tyr Tyr Tyr 
180 185 190 

Gly Phe Gly cys Asn Lys Phe Cys Arg Pro Arg Asp Asp Phe Phe Gly 
195 200 205 

His Tyr Ala Cys Asp Gin Asn Gly Asn Lys Thr Cys Met Glu Gly Trp 
210 215 220 

Met Gly Pro Glu cys Asn Arg Ala lie cys Arg Gin Gly Cys Ser Pro 
225 230 235 240 

Lys His Gly ser Cys Lys Leu Pro Gly Asp Cys Arg Cys Gin Tyr Gly 
245 250 255 

Trp Gin Gly Leu Tyr Cys Asp Lys Cys lie Pro His Pro Gly cys val 
260 265 270 

His Gly He Cys Asn Glu Pro Trp Gin cys Leu cys Glu Thr Asn Trp 
275 280 285 

Gly Gly Gin Leu Cys Asp Lys Asp Leu Asn Tyr cys Gly Thr His Gin 
290 295 300 

pro Cys Leu Asn Gly Gly Thr Cys ser Asn Thr Gly Pro Asp Lys Tyr 
305 310 315 320 

Gin cys ser cys Pro Glu Gly Tyr Ser Gly Pro Asn Cys Glu He Ala 
325 330 335 

Glu His Ala cys Leu Ser Asp Pro Cys His Asn Arg Gly ser Cys Lys 
340 345 350 

Glu Thr Ser Leu Gly Phe Glu Cys Glu Cys Ser Pro Gly Trp Thr Gly 
355 360 365 

Pro Thr cys ser Thr Asn He Asp Asp Cys Ser Pro Asn Asn cys Ser 
370 375 380 

His Gly Gly Thr cys Gin Asp Leu Val Asn Gly Phe Lys cys Val Cys 
385 390 395 400 

Pro pro Gin Trp Thr Gly Lys Thr cys Gin Leu Asp Ala Asn Glu Cys 
405 410 415 

Glu Ala Lys Pro Cys val Asn Ala Lys Ser Cys Lys Asn Leu lie Ala 
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420 425 430 

Ser Tyr Tyr Cys Asp Cys Leu Pro Gly Trp Met Gly Gin Asn Cys Asp 
435 440 445 

lie Asn lie Asn Asp cys Leu Gly Gin cys Gin Asn Asp Ala ser Cys 
450 455 460 

Arg Asp Leu Val Asn Gly Tyr Arg Cys lie cys Pro Pro Gly Tyr Ala 
465 470 475 480 

Gly Asp His cys Glu Arg Asp lie Asp Glu Cys Ala ser Asn Pro cys 

485 490 _ 495 

Leu Asn Gly Gly His Cys Gin Asn Glu lie Asn Arg Phe Gin Cys Leu 
500 505 510 

Cys Pro Thr Gly Phe Ser Gly Asn Leu Cys Gin Leu Asp lie Asp Tyr 
515 520 525 

Cys Glu Pro Asn Pro cys Gin Asn Gly Ala Gin Cys Tyr Asn Arg Ala 
530 535 540 

Ser Asp Tyr Phe Cys Lys Cys Pro Glu Asp Tyr Glu Gly Lys Asn Cys 
545 550 555 560 

Ser His Leu Lys Asp His Cys Arg Thr Thr Pro Cys Glu Val lie Asp 



Ser cys Thr Val Ala Met Ala Ser Asn Asp Thr Pro Glu Gly Val Arg 
580 585 590 

Tyr lie Ser Ser Asn Val Cys Gly Pro His Gly Lys cys Lys Ser Gin 
595 600 605 

ser Gly Gly Lys Phe Thr cys Asp Cys Asn Lys Gly Phe Thr Gly Thr 
610 615 620 

Tyr cys His Glu Asn lie Asn Asp Cys Glu Ser Asn Pro cys Arg Asn 
625 630 635 640 

Gly Gly Thr Cys He Asp Gly Val Asn Ser Tyr Lys Cys He Cys Ser 
645 650 655 

Asp Gly Trp Glu Gly Ala Tyr cys Glu Thr Asn lie Asn Asp Cys Ser 
660 665 670 

Gin Asn Pro cys His Asn Gly Gly Thr cys Arg Asp Leu Val Asn Asp 
675 680 685 

Phe Tyr cys Asp Cys Lys Asn Gly Trp Lys Gly Lys Thr cys His ser 
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.txt 
700 



690 



695 



Arg Asp Ser Gin Cys Asp Glu Ala Thr Cys Asn Asn Gly Gly Thr Cys 
705 710 715 720 



Tyr Asp Glu Gly Asp Ala phe Lys cys Met Cys Pro Gly Gly Trp Glu 
725 730 735 



Gly Thr Thr Cys Asn lie Ala Arg Asn Ser Ser Cys Leu Pro Asn Pro 
740 745 750 



Cys His Asn Gly Gly Thr cys val val Asn Gly Glu ser Phe Thr Cys 
755 760 765 



Val Cys Lys Glu Gly Trp Glu Gly Pro lie Cys Ala Gin Asn Thr Asn 
770 775 780 



Asp Cys Ser Pro His Pro Cys Tyr Asn Ser Gly Thr Cys Val Asp Gly 
785 790 795 800 



Asp Asn Trp Tyr Arg Cys Glu cys Ala Pro Gly Phe Ala Gly Pro Asp 
805 810 815 



Cys Arg lie Asn lie Asn Glu cys Gin Ser Ser Pro Cys Ala Phe Gly 
820 825 830 



Ala Thr cys Val Asp Glu He Asn Gly Tyr Arg cys val cys Pro Pro 
835 840 845 



Gly His Ser Gly Ala Lys Cys Gin Glu val Ser Gly Arg Pro Cys lie 
850 855 860 



Thr Met Gly ser Val He Pro Asp Gly Ala Lys Trp Asp Asp Asp cys 
865 870 875 880 



Asn Thr cys Gin cys Leu Asn Gly Arg lie Ala Cys ser Lys Val Trp 
885 890 895 



Cys Gly Pro Arg Pro cys Leu Leu His Lys Gly His Ser Glu Cys Pro 
900 905 910 



ser Gly Gin Ser cys lie Pro He Leu Asp Asp Gin cys Phe val His 
915 920 925 



Pro cys Thr Gly val Gly Glu cys Arg ser ser Ser Leu Gin Pro Val 
930 935 940 



Lys Thr Lys Cys Thr Ser Asp Ser Tyr Tyr Gin Asp Asn Cys Ala Asn 
945 950 955 960 



lie Thr Phe Thr Phe Asn Lys Glu Met Met Ser Pro Gly Leu Thr Thr 
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965 970 975 

Glu His lie Cys Ser Glu Leu Arg Asn Leu Asn lie Leu Lys Asn Val 
980 985 990 

Ser Ala Glu Tyr Ser lie Tyr lie Ala Cys Glu Pro ser Pro Ser Ala 
995 1000 1005 

Asn Asn Glu lie His Val Ala lie Ser Ala Glu Asp lie Arg Asp 
1010 1015 1020 

Asp Gly Asn Pro lie Lys Glu lie Thr Asp Lys lie lie Asp Leu 
1025 1030 1035 

Val ser Lys Arg Asp Gly Asn ser ser Leu lie Ala Ala Val Ala 
1040 1045 1050 

Glu val Arg Val Gin Arg Arg Pro Leu Lys Asn Arg Thr Asp Phe 
1055 1060 1065 

Leu Val Pro Leu Leu Ser Ser Val Leu Thr val Ala Trp lie cys 
1070 1075 1080 

Cys Leu Val Thr Ala Phe Tyr Trp Cys Leu Arg Lys Arg Arg Lys 
1085 1090 1095 

Pro Gly Ser His Thr His Ser Ala Ser Glu Asp Asn Thr Thr Asn 
1100 1105 1110 

Asn val Arg Glu Gin Leu Asn Gin lie Lys Asn Pro lie Glu Lys 
1115 H20 H25 

His Gly Ala Asn Thr Val Pro lie Lys Asp Tyr Glu Asn Lys Asn 
1130 H35 1140 

Ser Lys Met Ser Lys He Arg Thr His Asn ser Glu val Glu Glu 
1145 H50 H55 

Asp Asp Met Asp Lys His Gin Gin Lys Ala Arg Phe Ala Lys Gin 
1160 1165 H70 

Pro Ala Tyr Thr Leu Val Asp Arg Glu Glu Lys Pro Pro Asn Gly 
1175 H80 1185 

Thr Pro Thr Lys His Pro Asn Trp Thr Asn Lys Gin Asp Asn Arg 
1190 1195 1200 

Asp Leu Glu ser Ala Gin ser Leu Asn Arg Met Glu Tyr He Val 
H 1205 1210 1215 
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<211> 5077 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Jagged~2 

<400> 24 

ctcatgcata tgcaggtgcg cgggtgacga atgggcgagc gagctgtcag tctcgttccg 60 

aacttgttgg ctgcggtgcc gggagcgcgg gcgcgcagag ccgaggccgg gacccgctgc 120 

cttcaccgcc gccgccgtcg ccgccgggtg ggagccgggc cgggcagccg gagcgcggcc 180 

gccagcgagc cggagctgcc gccgcccctg cacgcccgcc gcccaggccc gcgcgccgcg 240 

gcgctgcgct cgaccccgcc cgcgccgccg ccgccgccgc ctctgccgct gccgctgcct 300 

ctgcgggcgc tcggagggcg ggcgggcgct gggaggccgg cgcggcggct gggagccggg 360 

cgcgggcggc ggcggcgggg ccgggcgggc gggtcgcggg ggcaatgcgg gcgcagggcc 420 

gggggcgcct tccccggcgg ctgctgctgc tgctggcgct ctgggtgcag gcggcgcggc 480 

ccatgggcta tttcgagctg cagctgagcg cgctgcggaa cgtgaacggg gagctgctga 540 

gcggcgcctg ctgtgacggc gacggccgga caacgcgcgc ggggggctgc ggccacgacg 600 

agtgcgacac gtacgtgcgc gtgtgcctta aggagtacca ggccaaggtg acgcccacgg 660 

ggccctgcag ctacggccac ggcgccacgc ccgtgctggg cggcaactcc ttctacctgc 720 

cgccggcggg cgctgcgggg gaccgagcgc gggcgcgggc ccgggccggc ggcgaccagg 780 

acccgggcct cgtcgtcatc cccttccagt tcgcctggcc gcgctccttt accctcatcg 840 

tggaggcctg ggactgggac aacgatacca ccccgaatga ggagctgctg atcgagcgag 900 

tgtcgcatgc cggcatgatc aacccggagg accgctggaa gagcctgcac ttcagcggcc 960 

acgtggcgca cctggagctg cagatccgcg tgcgctgcga cgagaactac tacagcgcca 1020 

cttgcaacaa gttctgccgg ccccgcaacg actttttcgg ccactacacc tgcgaccagt 1080 

acggcaacaa ggcctgcatg gacggctgga tgggcaagga gtgcaaggaa gctgtgtgta 1140 

aacaagggtg taatttgctc cacgggggat gcaccgtgcc tggggagtgc aggtgcagct 1200 

acggctggca agggaggttc tgcgatgagt gtgtccccta ccccggctgc gtgcatggca 1260 

gttgtgtgga gccctggcag tgcaactgtg agaccaactg gggcggcctg ctctgtgaca 1320 

aagacctgaa ctactgtggc agccaccacc cctgcaccaa cggaggcacg tgcatcaacg 1380 

ccgagcctga ccagtaccgc tgcacctgcc ctgacggcta ctcgggcagg aactgtgaga 1440 

aggctgagca cgcctgcacc tccaacccgt gtgccaacgg gggctcttgc catgaggtgc 1500 

cgtccggctt cgaatgccac tgcccatcgg gctggagcgg gcccacctgt gcccttgaca 1560 

tcgatgagtg tgcttcgaac ccgtgtgcgg ccggtggcac ctgtgtggac caggtggacg 1620 

gctttgagtg catctgcccc gagcagtggg tgggggccac ctgccagctg gacgccaatg 1680 

agtgtgaagg gaagccatgc cttaacgctt tttcttgcaa aaacctgatt ggcggctatt 1740 
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actgtgattg 


catcccgggc 


39467A.txt 
tggaagggca tcaactgcca 


.txt 

tatcaacgtc 


aacgactgtc 


1800 


gcgggcagtg 


tcagcatggg 


ggcacctgca aggacctggt 


gaacgggtac 


cagtgtgtgt 


1860 


gcccacgggg 


cttcggaggc 


cggcattgcg agctggaacg 


agacgagtgt 


gccagcagcc 


1920 


cctgccacag 


cggcggcctc 


tgcgaggacc tggccgacgg 


cttccactgc 


cactgccccc 


1980 


agggcttctc 


cgggcctctc 


tgtgaggtgg atgtcgacct 


ttgtgagcca 


agcccctgcc 


2040 


ggaacggcgc 


tcgctgctat 


aacctggagg gtgactatta 


ctgcgcctgc 


cctgatgact 


2100 


ttggtggcaa 


gaactgctcc 


gtgccccgcg agccgtgccc 


tqqcqaqqcc 
L yy *-y yyy 


tgcagagtga 


2160 


tcgatggctg 


cgggtcagac 


acaaaaccta aaatacctqq 


cacagcagcc 


tccggcgtgt 


2220 


gtggccccca 


tqqacqctqc 


gtcagccagc cagggggcaa 


cttttcctgc 


atctgtgaca 


2280 


gtggctttac 


tggcacctac 


tgccatgaga acattgacga 


ctgcctgggc 


cagccctgcc 


2340 


gcaatggggg 


cacatgcatc 


gatgaggtgg acgccttccg 


ctgcttctgc 


cccagcggct 


2400 


aaaaaaacaa 


gctctgcgac 


accaatccca acgactgcct 


tcccgatccc 


tgccacagcc 


2460 


gcggccgctg 


ctacgacctg 


gtcaatgact tctactgtgc 


gtgcgacgac 


ggctggaagg 


2520 


gcaagacctg 


ccactcacgc 


gagttccagt gcgatgccta 


cacctgcagc 


aacggtggca 


2580 


cctgctacga 


cagcggcgac 


accttccgct gcgcctgccc 


ccccggctgg 


aagggcagca 


2640 


cctgcgccgt 


cgccaagaac 


agcagctgcc tgcccaaccc 


ctgtgtgaat 


ggtggcacct 


2700 


y*-y L -yyy*- a, y 


cggggcctcc 


ttctcctgca tctgccggga 


cggctgggag 


ggtcgtactt 


2760 


gcactcacaa 


taccaacgac 


tgcaaccctc tgccttgcta 


caatggtggc 


atctgtgttg 


2820 


acggcgtcaa 


ctggttccgc 


tgcgagtgtg cacctggctt 


cgcggggcct 


gactgccgca 


2880 


tcaacatcga 


cgagtgccag 


tcctcgccct gtgcctacgg 


ggccacgtgt 


gtggatgaga 


2940 


tcaacgggta 


tcgctgtagc 


tgcccacccg gccgagccgg 


cccccggtgc 


caggaagtga 


3000 


tcgggttcgg 


gagatcctgc 


tggtcccggg gcactccgtt 


cccacacgga 


agctcctggg 


3060 


tggaagactg 


caacagctgc 


cgctgcctgg atggccgccg 


tgactgcagc 


aaggtgtggt 


3120 


gcggatggaa 


gccttgtctg 


ctggccggcc agcccgaggc 


cctgagcgcc 


cagtgcccac 


3180 


tggggcaaag 


atacctqqaq 


aaggccccag gccagtgtct 


gcgaccaccc 


tgtgaggcct 


3240 


aaaaaaaatci 
yyyyyy G y '-y 


cggcgcagaa 


gagccaccga gcaccccctg 


cctgccacgc 


tccggccacc 


3300 


tggacaataa 


ctgtgcccgc 


ctcaccttgc atttcaaccg 


tgaccacgtg 


ccccagggca 


3360 


ccacaataaa 


cgccatttgc 


tccgggatcc gctccctgcc 


agccacaagg 


gctgtggcac 


3420 


gggaccgcct 


gctggtgttg 


ctttgcgacc gggcgtcctc 


gggggccagt 


gccgtggagg 


3480 


tggccgtgtc 


cttcagccct 


gccagggacc tgcctgacag 


cagcctgatc 


cagggcgcgg 


3540 




cgtggccgcc 


atcacccagc gggggaacag 


ctcactgctc 


ctggctgtca 


3600 


ccgaggtcaa 


ggtggagacg 


gttgttacgg gcggctcttc 


cacaggtctg 


ctggtgcctg 


3660 


tgctgtgtgg 


tgccttcagc 


gtgctgtggc tggcgtgcgt 


ggtcctgtgc 


gtgtggtgga 


3720 


cacgcaagcg 


caggaaagag 


cgggagagga gccggctgcc 


gcgggaggag 


agcgccaaca 


3780 
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accagtgggc 
acgtgctcta 
ccgggccggc 
gtgaggagga 
gccgctcgcc 
tcaggagcat 
cgggacccag 
gtgcatagtt 
ttctacgttt 
gtattctcgg 
ggagagagca 
gcactgttta 
aagtgccggt 
gctgatggtg 
gtcggtggga 
cacggcccgt 
gcccaaggcc 
gcttcctcct 
cttcctgctg 
gccttcacac 
tgccggcccc 
ggaagcactc 



cccgctcaac 
ccagtgcaag 
cggccacgcg 
ctccctggag 
ggggaggccg 
caatgaggcc 
ggccctcggt 
tctttatttt 
ctttaacctt 
atagttgcta 
aagggtgtct 
cagaatcttc 
gagacccatg 
tggcctgtgg 
cctacggtgg 
ggctgttgac 
ggcaggtcaa 
gcccagaacg 
cgcagttctc 
agacccctcg 
ggccgcccct 
tgtatatttg 



cccatccgca 
aacttcacgc 
gccgtcaggg 
gcggagaagt 
gcccactggg 
cgctacgccg 
gggagccatg 
gtgtaaaaaa 
gtataaatta 
tttttgtaaa 
gcgtcgtcac 
cttttattcc 
gctgtgttgg 
ctgtcggtgg 
tcggtgggac 
gcacctgtgg 
cctcgcgctt 
cccgctccag 
ccatcctggg 
caactgtcca 
cctcgtgaaa 
attgaataat 



39467A.txt 
accccatcga 

cgccgccgcg 

aggatgagga 

tcctctcaca 

cctcaggccc 

gcaaggagta 

ccgtctgccg 

accaccaaaa 

ttcagtaact 

gtttccgtgc 

caaatcgtag 

tcactcgggt 

tgtggcccat 

gactcgtggc 

cctggttatt 

ttgttagtgg 

gctggccagt 

cgatctctcc 

acggcggcag 

cgcgtgccgt 

gtgcattttt 

gccacca 



.txt 

gcggccgggg 
cagggcggac 
ggacgaggat 
caaattcacc 
caaagtggac 
ggggcggctg 
gacccggagg 
acaaaaacca 
gtcaggctga 
gtggcactcg 
cgtttgttac 
ttctctgtgg 
ggctgttggt 
tgtcaatggg 
gatgtggccc 
ggcctgaggt 
ccaccctgcc 
actgtgcttt 
tattgaagct 
ggcaccaggc 
gtaaatgtgt 



ggccacaagg 
gaggcgctgc 
ctgggccgcg 
aaagatcctg 
aaccgcgcgg 
ccagctgggc 
ccgaggccat 
aatgtttatt 
aaacaatgga 
ctgtatgaaa 
cagaggttgt 
ctccaggcca 
gggacccgtg 
acctgtggct 
tggctgccgg 
catcggcgtg 
tgccgtctgt 
cagaagtgcc 
cgtgacaagt 
gctgcccacc 
acatattaaa 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5077 



<210> 25 

<211> 1238 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Jagged-2 



Met Arq Ala Gin Gly Arg Gly Arg Leu Pro Arg Arg Leu Leu Leu Leu 
15 10 15 

Leu Ala Leu Trp Val Gin Ala Ala Arg Pro Met Gly Tyr Phe Glu Leu 
20 25 30 



Gin Leu ser Ala Leu Arg Asn Val Asn Gly Glu Leu Leu Ser Gly Ala 
35 40 45 
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Cys Cys Asp Gly Asp Gly Arg Thr Thr Arg Ala Gly Gly Cys Gly His 
50 55 60 



Asp Glu cys Asp Thr Tyr val Arg Val Cys Leu Lys Glu Tyr Gin Ala 
65 70 75 80 



Lys Val Thr Pro Thr Gly Pro Cys Ser Tyr Gly His Gly Ala Thr Pro 
85 90 95 



val Leu Gly Gly Asn ser Phe Tyr Leu Pro Pro Ala Gly Ala Ala Gly 
100 105 110 



Asp Arg Ala Arg Ala Arg Ala Arg Ala Gly Gly Asp Gin Asp Pro Gly 
115 120 125 



Leu val val He pro Phe Gin Phe Ala Trp Pro Arg ser Phe Thr Leu 
130 135 140 



lie val Glu Ala Trp Asp Trp Asp Asn Asp Thr Thr Pro Asn Glu Glu 
145 150 155 160 



Leu Leu lie Glu Arg Val Ser His Ala Gly Met lie Asn Pro Glu Asp 
165 170 175 



Arg Trp Lys Ser Leu His Phe ser Gly His val Ala His Leu Glu Leu 
180 185 190 



Gin lie Arg Val Arg Cys Asp Glu Asn Tyr Tyr Ser Ala Thr Cys Asn 
195 200 205 



Lys Phe cys Arg Pro Arg Asn Asp Phe Phe Gly His Tyr Thr Cys Asp 
210 215 220 



Gin Tyr Gly Asn Lys Ala cys Met Asp Gly Trp Met Gly Lys Glu Cys 
225 230 235 240 



Lys Glu Ala Val Cys Lys Gin Gly Cys Asn Leu Leu His Gly Gly Cys 
245 250 255 



Thr val Pro Gly Glu Cys Arg cys ser Tyr Gly Trp Gin Gly Arg Phe 
260 265 270 



Cys Asp Glu Cys Val Pro Tyr Pro Gly Cys Val His Gly Ser Cys Val 
275 280 285 



Glu pro Trp Gin Cys Asn Cys Glu Thr Asn Trp Gly Gly Leu Leu cys 
290 295 300 



Asp Lys Asp Leu Asn Tyr Cys Gly Ser His His Pro cys Thr Asn Gly 

™V- 3 r Tin oic 



305 



310 



315 



320 
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Gly Thr cys He Asn Ala Glu Pro Asp Gin Tyr Arg Cys Thr Cys Pro 
325 330 335 

Asp Gly Tyr Ser Gly Arg Asn cys Glu Lys Ala Glu His Ala cys Thr 
340 345 350 

Ser Asn Pro cys Ala Asn Gly Gly Ser Cys His Glu Val Pro Ser Gly 
355 360 365 

Phe Glu cys His cys pro ser Gly Trp ser Gly Pro Thr cys Ala Leu 
370 375 380 

Asp lie Asp Glu Cys Ala Ser Asn Pro Cys Ala Ala Gly Gly Thr cys 
385 390 395 400 

val Asp Gin Val Asp Gly Phe Glu Cys lie Cys Pro Glu Gin Trp Val 
405 410 415 

Gly Ala Thr cys Gin Leu Asp Ala Asn Glu cys Glu Gly Lys Pro Cys 
420 425 430 

Leu Asn Ala Phe Ser Cys Lys Asn Leu He Gly Gly Tyr Tyr Cys Asp 
435 440 445 

Cys He Pro Gly Trp Lys Gly lie Asn cys His lie Asn val Asn Asp 
450 455 460 

Cys Arg Gly Gin Cys Gin His Gly Gly Thr Cys Lys Asp Leu Val Asn 
465 470 475 480 

Gly Tyr Gin Cys Val Cys Pro Arg Gly Phe Gly Gly Arg His Cys Glu 
485 490 495 

Leu Glu Arg Asp Glu Cys Ala ser ser Pro Cys His Ser Gly Gly Leu 
500 505 510 

Cys Glu Asp Leu Ala Asp Gly Phe His Cys His Cys Pro Gin Gly Phe 
515 520 525 

Ser Gly Pro Leu Cys Glu val Asp Val Asp Leu cys Glu Pro ser Pro 
530 ' 535 540 

cys Arg Asn Gly Ala Arg Cys Tyr Asn Leu Glu Gly Asp Tyr Tyr Cys 
545 550 555 560 

Ala Cys Pro Asp Asp phe Gly Gly Lys Asn Cys Ser Val Pro Arg Glu 
565 570 575 

Pro cys Pro Gly Gly Ala cys Arg val lie Asp Gly cys Gly ser Asp 
580 585 590 
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Ala Gly Pro 
595 



Gly 



Met Pro 



Gly 



Thr Ala Ala Ser Gly Val Cys 
600 605 



Gly Pro 



His Gly Arg cys Val ser Gin Pro Gly Gly Asn Phe ser cys lie Cys 
610 615 620 



Asp Ser Gly Phe Thr Gly Thr Tyr Cys His Glu Asn lie Asp Asp Cys 
625 630 635 640 



Leu Gly Gin Pro Cys Arg Asn Gly Gly Thr cys lie Asp Glu Val Asp 
645 650 655 



Ala Phe Arg Cys Phe Cys Pro Ser Gly Trp Glu Gly Glu Leu Cys Asp 
660 665 670 



Thr Asn Pro Asn Asp Cys Leu Pro Asp Pro Cys His Ser Arg Gly Arg 
675 680 685 



Cys Tyr Asp Leu Val Asn Asp Phe Tyr cys Ala cys Asp Asp Gly Trp 
690 695 700 



Lys Gly Lys Thr Cys His Ser Arg Glu Phe Gin cys Asp Ala Tyr Thr 
705 710 715 720 



cys ser Asn Gly Gly Thr cys Tyr Asp Ser Gly Asp Thr Phe Arg cys 



Ala Cys Pro Pro Gly Trp Lys Gly Ser Thr Cys Ala Val Ala Lys Asn 
740 745 750 



Ser Ser Cys Leu Pro Asn Pro Cys Val Asn Gly Gly Thr Cys Val Gly 
755 760 765 



Ser Gly Ala ser Phe ser cys lie cys Arg Asp Gly Trp Glu Gly Arg 
770 775 780 



Thr Cys Thr His Asn Thr Asn Asp Cys Asn Pro Leu Pro Cys Tyr Asn 
785 790 " 795 800 



Gly Gly lie Cys Val Asp Gly Val Asn Trp Phe Arg cys Glu cys Ala 
805 810 815 



Pro Gly Phe Ala Gly Pro Asp cys Arg lie Asn lie Asp Glu Cys Gin 
820 825 830 



Ser Ser Pro Cys Ala Tyr Gly Ala Thr Cys Val Asp Glu lie Asn Gly 
835 840 845 



Tyr Arg Cys Ser Cys Pro Pro Gly Arg Ala Gly Pro Arg Cys Gin Glu 




855 



860 
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Val He Gly Phe Gly Arg Ser Cys Trp Ser Arg Gly Thr Pro Phe Pro 
865 870 875 880 

His Gly ser ser Trp val Glu Asp Cys Asn Ser cys Arg Cys Leu Asp 
885 890 895 

Gly Arg Arg Asp Cys Ser Lys Val Trp cys Gly Trp Lys Pro Cys Leu 
900 905 910 

Leu Ala Gly Gin Pro Glu Ala Leu Ser Ala Gin cys Pro Leu Gly Gin 
915 920 925 

Arg Cys Leu Glu Lys Ala Pro Gly Gin Cys Leu Arg Pro Pro Cys Glu 
930 935 940 

Ala Trp Gly Glu Cys Gly Ala Glu Glu Pro Pro Ser Thr Pro Cys Leu 
945 950 955 960 

Pro Arg ser Gly His Leu Asp Asn Asn cys Ala Arg Leu Thr Leu His 
965 970 " 975 

Phe Asn Arg Asp His Val Pro Gin Gly Thr Thr Val Gly Ala lie Cys 
980 985 990 

Ser Gly lie Arg Ser Leu Pro Ala Thr Arg Ala Val Ala Arg Asp Arg 
995 1000 1005 

Leu Leu val Leu Leu cys Asp Arg Ala ser ser Gly Ala Ser Ala 
1010 1015 1020 

Val Glu Val Ala Val Ser Phe Ser Pro Ala Arg Asp Leu Pro Asp 
1025 1030 1035 

Ser Ser Leu He Gin Gly Ala Ala His Ala lie Val Ala Ala He 
1040 1045 1050 

Thr Gin Arg Gly Asn Ser Ser Leu Leu Leu Ala Val Thr Glu Val 
1055 1060 1065 

Lys Val Glu Thr Val Val Thr Gly Gly Ser Ser Thr Gly Leu Leu 
1070 1075 1080 

val pro val Leu Cys Gly Ala Phe ser val Leu Trp Leu Ala Cys 
1085 1090 1095 

Val Val Leu Cys Val Trp Trp Thr Arg Lys Arg Arg Lys Glu Arg 
1100 1105 1110 

Glu Arg Ser Arg Leu Pro Arg Glu Glu Ser Ala Asn Asn Gin Trp 
1115 1120 1125 
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Ala pro Leu Asn pro lie Arg Asn Pro lie Glu Arg Pro Gly Gly 

1130 1135 1140 

His Lys Asp Val Leu Tyr Gin Cys Lys Asn Phe Thr Pro Pro Pro 

1145 1150 1155 

Arg Arg Ala Asp Glu Ala Leu Pro Gly pro Ala Gly His Ala Ala 

1160 1165 1170 

Val Arg Glu Asp Glu Glu Asp Glu Asp Leu Gly Arg Gly Glu Glu 

1175 1180 1185 

"Asp ser Leu Glu Ala Glu Lys Phe Leu Ser His Lys Phe Thr Lys 

1190 1195 1200 

Asp pro Gly Arg Ser Pro Gly Arg Pro Ala His Trp Ala Ser Gly 

1205 1210 1215 

Pro Lys Val Asp Asn Arg Ala Val Arg Ser lie Asn Glu Ala Arg 

1220 1225 " 1230 

Tyr Ala Gly Lys Glu 
1235 

<210> 26 

<211> 4963 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> 3agged2, transcript variant 2 




<400> 26 
ctcatgcata 


tgcaggtgcg 


cgggtgacga 


atgggcgagc gagctgtcag tctcgttccg 


60 


aacttgttgg 


ctgcggtgcc 


gggagcgcgg 


gcgcgcagag ccgaggccgg gacccgctgc 


120 


cttcaccgcc 


gccgccgtcg 


ccgccgggtg 


ggagccgggc cgggcagccg gagcgcggcc 


180 


gccagcgagc 


cggagctgcc 


gccgcccctg 


cacgcccgcc gcccaggccc gcgcgccgcg 


240 


gcgctgcgct 


cgaccccgcc 


cgcgccgccg 


ccgccgccgc ctctgccgct gccgctgcct 


300 


ctgcgggcgc 


tcggagggcg 


ggcgggcgct 


gggaggccgg cgcggcggct gggagccggg 


360 


cgcgggcggc 


ggcggcgggg 


ccgggcgggc 


gggtcgcggg ggcaatgcgg gcgcagggcc 


420 


gggggcgcct 


tccccggcgg 


ctgctgctgc 


tgctggcgct ctgggtgcag gcggcgcggc 


480 


ccatgggcta 


tttcgagctg 


cagctgagcg 


cgctgcggaa cgtgaacggg gagctgctga 


540 


gcggcgcctg 


ctgtgacggc 


gacggccgga 


caacgcgcgc ggggggctgc ggccacgacg 


600 


agtgcgacac 


gtacgtgcgc 


gtgtgcctta 


aggagtacca ggccaaggtg acgcccacgg 


660 


ggccctgcag 


ctacggccac 


ggcgccacgc 


ccgtgctggg cggcaactcc ttctacctgc 


720 
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cgccggcggg cgctgcgggg gaccgagcgc gggcgcgggc ccgggccggc ggcgaccagg 780 

acccgggcct cgtcgtcatc cccttccagt tcgcctggcc gcgctccttt accctcatcg 840 

tggaggcctg ggactgggac aacgatacca ccccgaatga ggagctgctg atcgagcgag 900 

tgtcgcatgc cggcatgatc aacccggagg accgctggaa gagcctgcac ttcagcggcc 960 

acgtggcgca cctggagctg cagatccgcg tgcgctgcga cgagaactac tacagcgcca 1020 

cttgcaacaa gttctgccgg ccccgcaacg actttttcgg ccactacacc tgcgaccagt 1080 

acggcaacaa ggcctgcatg gacggctgga tgggcaagga gtgcaaggaa gctgtgtgta 1140 

aacaagggtg taatttgctc cacgggggat gcaccgtgcc tggggagtgc aggtgcagct 1200 

acggctggca agggaggttc tgcgatgagt gtgtccccta ccccggctgc gtgcatggca 1260 

gttgtgtgga gccctggcag tgcaactgtg agaccaactg gggcggcctg ctctgtgaca 1320 

aagacctgaa ctactgtggc agccaccacc cctgcaccaa cggaggcacg tgcatcaacg 1380 

ccgagcctga ccagtaccgc tgcacctgcc ctgacggcta ctcgggcagg aactgtgaga 1440 

aggctgagca cgcctgcacc tccaacccgt gtgccaacgg gggctcttgc catgaggtgc 1500 

cgtccggctt cgaatgccac tgcccatcgg gctggagcgg gcccacctgt gcccttgaca 1560 

tcgatgagtg tgcttcgaac ccgtgtgcgg ccggtggcac ctgtgtggac caggtggacg 1620 

gctttgagtg catctgcccc gagcagtggg tgggggccac ctgccagctg gacgtcaacg 1680 

actgtcgcgg gcagtgtcag catgggggca cctgcaagga cctggtgaac gggtaccagt 1740 

gtgtgtgccc acggggcttc ggaggccggc attgcgagct ggaacgagac gagtgtgcca 1800 

gcagcccctg ccacagcggc ggcctctgcg aggacctggc cgacggcttc cactgccact 1860 

gcccccaggg cttctccggg cctctctgtg aggtggatgt cgacctttgt gagccaagcc 1920 

cctgccggaa cggcgctcgc tgctataacc tggagggtga ctattactgc gcctgccctg 1980 

atgactttgg tggcaagaac tgctccgtgc cccgcgagcc gtgccctggc ggggcctgca 2040 

gagtgatcga tggctgcggg tcagacgcgg ggcctgggat gcctggcaca gcagcctccg 2100 

gcgtgtgtgg cccccatgga cgctgcgtca gccagccagg gggcaacttt tcctgcatct 2160 

gtgacagtgg ctttactggc acctactgcc atgagaacat tgacgactgc ctgggccagc 2220 

cctgccgcaa tgggggcaca tgcatcgatg aggtggacgc cttccgctgc ttctgcccca 2280 

gcggctggga gggcgagctc tgcgacacca atcccaacga ctgccttccc gatccctgcc 2340 

acagccgcgg ccgctgctac gacctggtca atgacttcta ctgtgcgtgc gacgacggct 2400 

ggaagggcaa gacctgccac tcacgcgagt tccagtgcga tgcctacacc tgcagcaacg 2460 

gtggcacctg ctacgacagc ggcgacacct tccgctgcgc ctgccccccc ggctggaagg 2520 

gcagcacctg cgccgtcgcc aagaacagca gctgcctgcc caacccctgt gtgaatggtg 2580 

gcacctgcgt gggcagcggg gcctccttct cctgcatctg ccgggacggc tgggagggtc 2640 

gtacttgcac tcacaatacc aacgactgca accctctgcc ttgctacaat ggtggcatct 2700 

gtgttgacgg cgtcaactgg ttccgctgcg agtgtgcacc tggcttcgcg gggcctgact 2760 
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gccgcatcaa 




39467A.txt 
"tgccsgtcct cgccctgtgc 


.txt 

ctacggggcc 


scgtgtgtgg 


2820 


atgagatcaa 


c 9 9 9 "t 3 ti c 9 c 


tgtagctgee cacccggccg 


3gccggcccc 


*-9 9 "t-9 c ^ ^-99 


2880 


aagt gat egg 


g "tt c g 9 9 3-9 ^ 


"tcctgctggt! cccggggcac 


tccgttccca 


C3cgg33gc"t 


2940 


cctgggtgga 


agactgeaac 


3gctgccgct gcctggatgg 


ccgccg"tga.c 


"tgcagcaagg 


3000 


tgtggtgcgg 


atggaagcct 


tgtctgctgg ccggccagcc 


cgaggccctg 


agcgcccagt! 


3060 


gcccactggg 


gcaaaggtgc 


ctg 939339 9 ccccsggccs 


gtgtctgcga 


ccaccctgtg 


3120 


aggcctgggg 


ggagtgcggc 


gcagaagagc caccgagcac 


cccctg cc tg 




3180 


gccacctgga 


caataactgt 


gcccgcctca ecttgeattt 


cssccgtigsc 


cacgtgcccc 


3240 


agggcaccac 


ggtgggcgcc 


att'tgetccg ggstccg etc 




cit-aaLjLjLj <_ Ly 


3300 


tggcacggga 


ccgcctgctg 


gtgttgcttt gcgaccgggc 


g~t ectegggg 


gccagtgccg 


3360 


tggaggtggc 


cgtgtccttc 


agccctgcca gggacctgcc 






3420 


gcgcggccca 


cgccatcgtg 


gccgcca ca cccagcgggg 


gaacagctca 


ctgctcctgg 


3480 


ctgtcaccga 


ggtcaaggtg 




C"tC"ttCC3C3 


ggtctgctgg 


3540 


tgcctgtgct 


gtgtggtgcc 


tt^gcgtgc tgtggrtggl 


g "t 9 c 9 1 9 gti c 


ctgtgcgtgt 


3600 


ggtggacacg 


caagegcagg 


333939 C999 3g3gg3gccg 


gctigccgcgg 


aaaaaaaara 


3660 




gtgggccccg 


C"tC33CCCC3 "tCCgC33CCC 


catcgagegg 




3720 


acaaggacgt 


gctctaccag 




gccgcgcagg 


y L yy av -y a yy 


3780 


cgctgcccgg 


gccggccggc 


cacgcggccg tcagggagga 


tgaggaggac 


gaggatctgg 


3840 


gccgcggtga 


9 9*^9 9 ^ ^-"^ ^ ^- 


c"tgg3ggcgg ag33g"t"tcct 


C"t C3C3C333 


ttcaccaaag 


3900 


atcctggccg 


c*tcgcc9999 


3ggccggccc actgggcctc 


3ggcccc333 


gtggacaacc 


3960 


gcgcggtcag 


g 3 9 C 3 "t C 3 3. "t 




yyay <-ayyyy 


cggctgccag 


4020 


ctgggccggg 


scccagggcc 


r~frn nt" firing nrratcicccit 


ctgccggacc 


eggaggcega 


4080 


ggccatgtgc 


at a.g 1 1 "t c*t t 




CC 33333 caa 


aaaccaaatg 


4140 


tttattttct 


3. c g "t 1 1 C "t "t t 


Hccttgtat Iwttattca 


gtiaactgtica 


ggctgaaaac 


4200 


aatggagtat 


teteggatag 


"t'tgcta'tti't't *tgt!3aag't"t"t 


ccgtigcgtigg 


cactcgctgt 


4260 


atgaaaggag 


3g 39c 3339 g 




■tcg"tagcg"t"t 


tgttaccaga 


4320 


ggttgtgcac 


tgtttacaga 


3"tcttcctt"t tsttcctcac 


■tegggtttet 


ctgtggctcc 


4380 


aggecaaagt 


geeggtgaga 




gcccatggct 


attaataaaa 
y y y L yyy u 


4440 


cccgtggctg 


atggtgtggc 


t_ Ly Lyy l Ly i_ ^y y iyyyai~ l 


cgtggctgtx 


aatgggacct 


4500 


gtggctgtcg 


gtgggaccta 


cggtggtcgg tgggaccctg 


gttattgstg 


tggccctggc 


4560 


tgccggcacg 


gcccgtggct 


9"t"tg3cgc3c ctigtiggtitigt! 


tsgtggggcc 


tgaggtcatc 


4620 


ggcgtggccc 


aaggceggea 


ggtcaacctc gcgcttgctg 


gccagtccac 


cctgcctgcc 


4680 


gtctgtgctt 


cctcctgccc 


agaacgcccg ctccagcgat 


ctctccsctg 


tgctttcaga 


4740 


agtgcccttc 


ctgctgcgca 


gttctcccat ectgggaegg 


eggesgtstt 


gaagctcgtg 


4800 
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acaagtgcct tcacacagac ccctcgcaac tgtccacgcg tgccgtggca ccaggcgctg 4860 

cccacctgcc ggccccggcc gcccctcctc gtgaaagtgc atttttgtaa atgtgtacat 4920 
attaaaggaa gcactctgta tatttgattg aataatgcca cca 4963 

<210> 27 

<211> 1200 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Jagged2, transcript variant 2 



Met Arg Ala Gin Gly Arg Gly Arg Leu Pro Arg Arg Leu Leu Leu Leu 

15 10 15 

Leu Ala Leu Trp Val Gin Ala Ala Arg Pro Met Gly Tyr Phe Glu Leu 

20 25 30 

Gin Leu ser Ala Leu Arg Asn val Asn Gly Glu Leu Leu Ser Gly Ala 

35 40 45 

Cys Cys Asp Gly Asp Gly Arg Thr Thr Arg Ala Gly Gly Cys Gly His 



Asp Glu cys Asp Thr Tyr Val Arg Val cys Leu Lys Glu Tyr Gin Ala 
65 70 75 80 

Lys Val Thr Pro Thr Gly Pro cys Ser Tyr Gly His Gly Ala Thr Pro 
85 90 95 

val Leu Gly Gly Asn Ser Phe Tyr Leu Pro Pro Ala Gly Ala Ala Gly 
100 105 110 

Asp Arg Ala Arg Ala Arg Ala Arg Ala Gly Gly Asp Gin Asp Pro Gly 
115 120 125 

Leu Val Val lie Pro phe Gin Phe Ala Trp Pro Arg Ser Phe Thr Leu 
130 135 140 

lie Val Glu Ala Trp Asp Trp Asp Asn Asp Thr Thr pro Asn Glu Glu 
145 150 155 160 

Leu Leu lie Glu Arg val ser His Ala Gly Met lie Asn Pro Glu Asp 
165 170 175 

Arq Trp Lys Ser Leu His Phe Ser Gly His Val Ala His Leu Glu Leu 
y 180 185 190 

Gin lie Arg Val Arg Cys Asp Glu Asn Tyr Tyr Ser Ala Thr Cys Asn 
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txt 



195 



200 



205 



Lys Phe Cys Arg Pro Arg Asn Asp Phe Phe Gly His Tyr Thr Cys Asp 
210 215 220 



-Gin Tyr Gly Asn Lys Ala Cys Met Asp Gly Trp Met Gly Lys Glu Cys 
225 230 235 ' 240 



Lys Glu Ala Val Cys Lys Gin Gly Cys Asn Leu Leu His Gly Gly Cys 
245 250 255 



Thr val Pro Gly Glu cys Arg cys ser Tyr Gly Trp Gin Gly Arg phe 
260 265 270 



Cys Asp Glu Cys Val Pro Tyr Pro Gly Cys Val His Gly Ser Cys Val 
275 280 285 



Glu Pro Trp Gin Cys Asn Cys Glu Thr Asn Trp Gly Gly Leu Leu Cys 
290 295 300 



Asp Lys Asp Leu Asn Tyr cys Gly Ser His His Pro Cys Thr Asn Gly 
305 310 315 320 



Gly Thr Cys lie Asn Ala Glu Pro Asp Gin Tyr Arg Cys Thr cys Pro 
325 330 335 



Asp Gly Tyr Ser Gly Arg Asn cys Glu Lys Ala Glu His Ala cys Thr 
340 345 350 



Ser Asn Pro Cys Ala Asn Gly Gly Ser Cys His Glu Val Pro Ser Gly 
355 360 365 



Phe Glu Cys His Cys Pro Ser Gly Trp Ser Gly Pro Thr cys Ala Leu 
370 375 380 



Asp lie Asp Glu Cys Ala Ser Asn Pro Cys Ala Ala Gly Gly Thr Cys 
385 390 395 400 



Val Asp Gin Val Asp Gly Phe Glu Cys lie Cys Pro Glu Gin Trp Val 
405 410 415 



Gly Ala Thr Cys Gin Leu Asp val Asn Asp cys Arg Gly Gin cys Gin 
420 425 430 



His Gly Gly Thr cys Lys Asp Leu Val Asn Gly Tyr Gin Cys Val cys 
435 440 445 



Pro Arg Gly Phe Gly Gly Arg His Cys Glu Leu Glu Arg Asp Glu Cys 
450 455 460 



Ala ser Ser Pro cys His Ser Gly Gly Leu Cys Glu Asp Leu Ala Asp 
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465 470 475 480 

Gly Phe His Cys His Cys Pro Gin Gly Phe Ser Gly Pro Leu Cys Glu 
485 490 495 

Val Asp val Asp Leu cys Glu Pro Ser Pro Cys Arg Asn Gly Ala Arg 
500 505 510 

cys Tyr Asn Leu Glu Gly Asp Tyr Tyr Cys Ala Cys Pro Asp Asp Phe 
515 520 525 

Gly Gly Lys Asn cys ser val Pro Arg Glu Pro Cys Pro Gly Gly Ala 
530 535 540 

Cys Arg Val lie Asp Gly Cys Gly Ser Asp Ala Gly Pro Gly Met Pro 
545 550 555 560 

Gly Thr Ala Ala ser Gly Val Cys Gly Pro His Gly Arg cys Val Ser 
565 570 575 

Gin Pro Gly Gly Asn Phe Ser Cys lie Cys Asp ser Gly Phe Thr Gly 
580 585 590 

Thr Tyr Cys His Glu Asn lie Asp Asp cys Leu Gly Gin Pro Cys Arg 
595 600 605 

Asn Gly Gly Thr Cys lie Asp Glu val Asp Ala Phe Arg cys Phe cys 
610 615 620 

Pro Ser Gly Trp Glu Gly Glu Leu Cys Asp Thr Asn Pro Asn Asp cys 
625 630 635 640 

Leu Pro Asp Pro Cys His Ser Arg Gly Arg Cys Tyr Asp Leu val Asn 
645 650 655 

Asp Phe Tyr Cys Ala Cys Asp Asp Gly Trp Lys Gly Lys Thr cys His 
660 665 670 

Ser Arg Glu Phe Gin Cys Asp Ala Tyr Thr Cys Ser Asn Gly Gly Thr 
675 680 685 

cys Tyr Asp Ser Gly Asp Thr Phe Arg cys Ala cys Pro Pro Gly Trp 
690 695 700 

Lys Gly Ser Thr Cys Ala Val Ala Lys Asn Ser Ser Cys Leu Pro Asn 
705 710 715 720 

Pro Cys Val Asn Gly Gly Thr Cys Val Gly Ser Gly Ala Ser Phe Ser 
725 730 735 

cys He Cys Arg Asp Gly Trp Glu Gly Arg Thr Cys Thr His Asn Thr 
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740 



39467A.txt, 
745 



• txt 



750 



Asn Asp Cys Asn Pro Leu Pro Cys Tyr Asn Gly Gly lie Cys Val Asp 
755 760 765 



Gly val Asn Trp phe Arg cys Glu cys Ala pro Gly Phe Ala Gly Pro 
770 775 780 



Asp Cys Arg lie Asn lie Asp Glu Cys Gin Ser Ser Pro Cys Ala Tyr 
785 790 795 " 800 



Gly Ala Thr Cys Val Asp Glu lie Asn Gly Tyr Arg cys ser cys Pro 
805 810 815 



Pro Gly Arg Ala Gly Pro Arg Cys Gin Glu Val lie Gly phe Gly Arg 
820 825 830 



Ser Cys Trp Ser Arg Gly Thr Pro Phe Pro His Gly Ser Ser Trp Val 
835 840 845 



Glu Asp Cys Asn ser Cys Arg cys Leu Asp Gly Arg Arg Asp cys Ser 
850 855 860 



Lys Val Trp Cys Gly Trp Lys Pro Cys Leu Leu Ala Gly Gin Pro Glu 
865 870 875 880 



Ala Leu ser Ala Gin cys Pro Leu Gly Gin Arg Cys Leu Glu Lys Ala 
885 890 895 



Pro Gly Gin cys Leu Arg Pro Pro cys Glu Ala Trp Gly Glu cys Gly 
900 905 910 



Ala Glu Glu Pro Pro Ser Thr Pro Cys Leu Pro Arg Ser Gly His Leu 
915 920 925 



Asp Asn Asn Cys Ala Arg Leu Thr Leu His Phe Asn Arg Asp His Val 
930 935 940 



Pro Gin Gly Thr Thr Val Gly Ala lie Cys Ser Gly lie Arg Ser Leu 
945 950 955 960 



Pro Ala Thr Arg Ala Val Ala Arg Asp Arg Leu Leu Val Leu Leu cys 
965 970 975 



Asp Arq Ala Ser Ser Gly Ala Ser Ala Val Glu val Ala Val Ser Phe 
980 985 990 



Ser Pro Ala Arg Asp Leu Pro Asp Ser Ser Leu lie Gin Gly Ala Ala 
995 1000 1005 



His Ala lie val Ala Ala lie Thr Gin Arg Gly Asn Ser Ser Leu 
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1010 1015 1020 

Leu Leu Ala Val Thr Glu Val Lys Val Glu Thr Val Val Thr Gly 
1025 1030 1035 

Gly ser ser Thr Gly Leu Leu Val Pro Val Leu Cys Gly Ala Phe 
1040 1045 1050 

Ser Val Leu Trp Leu Ala Cys Val Val Leu Cys Val Trp Trp Thr 
1055 1060 1065 

Arg Lys Arg Arg Lys Glu Arg Glu Arg Ser Arg Leu Pro Arg Glu 
1070 1075 1080 

Glu ser Ala Asn Asn Gin Trp Ala Pro Leu Asn Pro lie Arg Asn 
1085 1090 1095 

Pro lie Glu Arg Pro Gly Gly His Lys Asp Val Leu Tyr Gin Cys 
1100 " 1105 1110 

Lys Asn Phe Thr Pro pro Pro Arg Arg Ala Asp Glu Ala Leu Pro 
1115 1120 1125 

Gly Pro Ala Gly His Ala Ala Val Arg Glu Asp Glu Glu Asp Glu 
1130 1135 1140 

Asp Leu Gly Arg Gly Glu Glu Asp Ser Leu Glu Ala Glu Lys Phe 
1145 1150 1155 

Leu ser His Lys Phe Thr Lys Asp Pro Gly Arg Ser Pro Gly Arg 
1160 1165 1170 

pro Ala His Trp Ala ser Gly Pro Lys Val Asp Asn Arg Ala Val 
1175 1180 1185 

Arg ser He Asn Glu Ala Arg Tyr Ala Gly Lys Glu 
1190 1195 1200 



<210> 28 

<211> 3158 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Delta like 1 (Notch ligand) 

<400> 28 



<400> 28 cn 

aaaccggaac ggggcccaac ttctggggcc tggagaaggg aaacgaagtc ccccccggtt 60 

tcccgaggtt gcctttcctc gggcatcctt ggtttcggcg ggacttcgca gggcggatat 120 

aaagaacggc gcctttggga agaggcggag accggcttta aagaaagaag tcttggtcct 180 
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gcggcttggg cgaggcaagg gcgaggcaag ggcgctttct gccgacgctc cccgtggccc 240 

tacgatcccc cgcgcgtccg ccgctgttct aaggagagaa gtgggggccc cccaggctcg 300 

cgcgtggagc gaagcagcat gggcagtcgg tgcgcgctgg ccctggcggt gctctcggcc 360 

ttgctgtgtc aggtctggag ctctggggtg ttcgaactga agctgcagga gttcgtcaac 420 

aagaaggggc tgctggggaa ccgcaactgc tgccgcgggg gcgcggggcc accgccgtgc 480 

gcctgccgga ccttcttccg cgtgtgcctc aagcactacc aggccagcgt gtcccccgag 540 

ccgccctgca cctacggcag cgccgtcacc cccgtgctgg gcgtcgactc cttcagtctg 600 

cccgacggcg ggggcgccga ctccgcgttc agcaacccca tccgcttccc cttcggcttc 660 

acctggccgg gcaccttctc tctgattatt gaagctctcc acacagattc tcctgatgac 720 

ctcgcaacag aaaacccaga aagactcatc agccgcctgg ccacccagag gcacctgacg 780 

gtgggcgagg agtggtccca ggacctgcac agcagcggcc gcacggacct caagtactcc 840 

taccgcttcg tgtgtgacga acactactac ggagagggct gctccgtttt ctgccgtccc 900 

cgggacgatg ccttcggcca cttcacctgt ggggagcgtg gggagaaagt gtgcaaccct 960 

ggctggaaag ggccctactg cacagagccg atctgcctgc ctggatgtga tgagcagcat 1020 

ggattttgtg acaaaccagg ggaatgcaag tgcagagtgg gctggcaggg ccggtactgt 1080 

gacgagtgta tccgctatcc aggctgtctc catggcacct gccagcagcc ctggcagtgc 1140 

aactgccagg aaggctgggg gggccttttc tgcaaccagg acctgaacta ctgcacacac 1200 

cataagccct gcaagaatgg agccacctgc accaacacgg gccaggggag ctacacttgc 1260 

tcttgccggc ctgggtacac aggtgccacc tgcgagctgg ggattgacga gtgtgacccc 1320 

agcccttgta agaacggagg gagctgcacg gatctcgaga acagctactc ctgtacctgc 1380 

ccacccggct tctacggcaa aatctgtgaa ttgagtgcca tgacctgtgc ggacggccct 1440 

tgctttaacg ggggtcggtg ctcagacagc cccgatggag ggtacagctg ccgctgcccc 1500 

gtgggctact ccggcttcaa ctgtgagaag aaaattgact actgcagctc ttcaccctgt 1560 

tctaatggtg ccaagtgtgt ggacctcggt gatgcctacc tgtgccgctg ccaggccggc 1620 

ttctcgggga ggcactgtga cgacaacgtg gacgactgcg cctcctcccc gtgcgccaac 1680 

gggggcacct gccgggatgg cgtgaacgac ttctcctgca cctgcccgcc tggctacacg 1740 

ggcaggaact gcagtgcccc cgtcagcagg tgcgagcacg caccctgcca caatggggcc 1800 

acctgccacc agaggggcca cggctatgtg tgcgaatgtg cccgaagcta cgggggtccc 1860 

aactgccagt tcctgctccc cgagctgccc ccgggcccag cggtggtgga cctcactgag 1920 

aagctagagg gccagggcgg gccattcccc tgggtggccg tgtgcgccgg ggtcatcctt 1980 

gtcctcatgc tgctgctggg ctgtgccgct gtggtggtct gcgtccggct gaggctgcag 2040 

aagcaccggc ccccagccga cccctgccgg ggggagacgg agaccatgaa caacctggcc 2100 

aactgccagc gtgagaagga catctcagtc agcatcatcg gggccacgca gatcaagaac 2160 

. accaacaaga aggcggactt ccacggggac cacagcgccg acaagaatgg cttcaaggcc 2220 
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cgctacccag cggtggacta taacctcgtg caggacctca agggtgacga caccgccgtc 2280 

agggacgcgc acagcaagcg tgacaccaag tgccagcccc agggctcctc aggggaggag 2340 

aaggggaccc cgaccacact caggggtgga gaagcatctg aaagaaaaag gccggactcg 2400 

ggctgttcaa cttcaaaaga caccaagtac cagtcggtgt acgtcatatc cgaggagaag 2460 

gatgagtgcg tcatagcaac tgaggtgtaa aatggaagtg agatggcaag actcccgttt 2520 

ctcttaaaat aagtaaaatt ccaaggatat atgccccaac gaatgctgct gaagaggagg 2580 

gaggcctcgt ggactgctgc tgagaaaccg agttcagacc gagcaggttc tcctcctgag 2640 

gtcctcgacg cctgccgaca gcctgtcgcg gcccggccgc ctgcggcact gccttccgtg 2700 

acgtcgccgt tgcactatgg acagttgctc ttaagagaat atatatttaa atgggtgaac 2760 

tgaattacgc ctaagaagca tgcactgcct gagtgtatat tttggattct tatgagccag 2820 

tcttttcttg aattagaaac acaaacactg cctttattgt cctttttgat acgaagatgt 2880 

gctttttcta gatggaaaag atgtgtgtta ttttttggat ttgtaaaaat atttttcatg 2940 

atatctgtaa agcttgagta ttttgtgatg ttcgtttttt ataatttaaa ttttggtaaa 3000 

tatgtacaaa ggcacttcgg gtctatgtga ctatattttt ttgtatataa atgtatttat 3060 

ggaatattgt gccaatgtta tttgagtttt ttactgtttt gttaatgaag aaattccttt 3120 

ttaaaatatt tttccaaaat aaattttatg aggaattc 3158 

<210> 29 

<211> 723 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Delta like 1 (Notch ligand) 



Met Gly ser Arg Cys Ala Leu Ala Leu Ala val Leu ser Ala Leu Leu 
1 5 10 15 

cys Gin val Trp ser Ser Gly Val Phe Glu Leu Lys Leu Gin Glu Phe 
20 25 30 

Val Asn Lys Lys Gly Leu Leu Gly Asn Arg Asn Cys Cys Arg Gly Gly 
35 40 " 45 

Ala Gly Pro Pro Pro Cys Ala cys Arg Thr Phe Phe Arg Val Cys Leu 
50 55 60 

Lys His Tyr Gin Ala ser val Ser Pro Glu Pro Pro Cys Thr Tyr Gly 
65 70 75 80 

Ser Ala Val Thr Pro Val Leu Gly Val Asp Ser Phe Ser Leu Pro Asp 
85 90 95 
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Gly Gly Gly Ala Asp Ser Ala Phe ser Asn pro lie Arg phe Pro Phe 
100 105 110 

Gly Phe Thr Trp Pro Gly Thr Phe Ser Leu He lie Glu Ala Leu His 
115 120 125 

Thr Asp ser pro Asp Asp Leu Ala Thr Glu Asn Pro Glu Arg Leu lie 
130 135 140 

Ser Arg Leu Ala Thr Gin Arg His Leu Thr Val Gly Glu Glu Trp Ser 
145 150 155 160 

Gin Asp Leu His ser ser Gly Arg Thr Asp Leu Lys Tyr Ser Tyr Arg 
165 170 175 

Phe val Cys Asp Glu His Tyr Tyr Gly Glu Gly Cys Ser Val Phe Cys 
180 185 190 

Arg pro Arg Asp Asp Ala Phe Gly His Phe Thr cys Gly Glu Arg Gly 
195 200 205 

Glu Lys Val Cys Asn Pro Gly Trp Lys Gly Pro Tyr Cys Thr Glu Pro 
210 215 220 

lie Cys Leu Pro Gly Cys Asp Glu Gin His Gly Phe Cys Asp Lys Pro 
225 230 235 240 

Gly Glu cys Lys cys Arg Val Gly Trp Gin Gly Arg Tyr cys Asp Glu 
245 250 255 

Cys lie Arg Tyr Pro Gly cys Leu His Gly Thr Cys Gin Gin Pro Trp 
260 265 270 

Gin cys Asn Cys Gin Glu Gly Trp Gly Gly Leu Phe cys Asn Gin Asp 
275 280 285 

Leu Asn Tyr Cys Thr His His Lys Pro Cys Lys Asn Gly Ala Thr Cys 
290 295 300 

Thr Asn Thr Gly Gin Gly Ser Tyr Thr cys Ser Cys Arg Pro Gly Tyr 
305 310 315 320 

Thr Gly Ala Thr Cys Glu Leu Gly lie Asp Glu Cys Asp Pro Ser Pro 
325 330 335 

Cys Lys Asn Gly Gly Ser cys Thr Asp Leu Glu Asn Ser Tyr Ser cys 
y 340 345 350 

Thr cys Pro Pro Gly Phe Tyr Gly Lys lie cys Glu Leu Ser Ala Met 
355 360 365 
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Thr Cys Ala Asp Gly Pro Cys phe Asn Gly Gly Arg cys Ser Asp Ser 
370 375 380 

Pro Asp Gly Gly Tyr Ser Cys Arg cys Pro Val Gly Tyr Ser Gly Phe 
385 390 395 400 

Asn Cys Glu Lys Lys lie Asp Tyr Cys Ser Ser Ser Pro Cys Ser Asn 
405 410 415 

Gly Ala Lys cys val Asp Leu Gly Asp Ala Tyr Leu cys Arg Cys Gin 
420 425 430 

Ala Gly Phe ser Gly Arg His cys Asp Asp Asn Val Asp Asp Cys Ala 
435 440 445 

Ser Ser Pro Cys Ala Asn Gly Gly Thr Cys Arg Asp Gly Val Asn Asp 
450 455 460 

Phe ser Cys Thr Cys Pro Pro Gly Tyr Thr Gly Arg Asn cys ser Ala 
465 470 475 480 

Pro val ser Arg cys Glu His Ala Pro Cys His Asn Gly Ala Thr Cys 



His Gin Arg Gly His Gly Tyr Val Cys Glu Cys Ala Arg Ser Tyr Gly 
500 505 510 

Gly Pro Asn cys Gin Phe Leu Leu Pro Glu Leu Pro Pro Gly Pro Ala 
515 520 525 

val val Asp Leu Thr Glu Lys Leu Glu Gly Gin Gly Gly Pro Phe Pro 
530 535 540 

Trp val Ala Val Cys Ala Gly Val lie Leu Val Leu Met Leu Leu Leu 
545 550 555 560 

Gly cys Ala Ala val val val cys Val Arg Leu Arg Leu Gin Lys His 
565 570 575 

Arg Pro Pro Ala Asp Pro cys Arg Gly Glu Thr Glu Thr Met Asn Asn 
580 585 590 

Leu Ala Asn Cys Gin Arg Glu Lys Asp lie 5er val ser lie lie Gly 
595 600 605 

Ala Thr Gin lie Lys Asn Thr Asn Lys Lys Ala Asp Phe His Gly Asp 
610 615 620 

His ser Ala Asp Lys Asn Gly Phe Lys Ala Arg Tyr Pro Ala Val Asp 
625 630 635 640 
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Tyr Asn Leu Val Gin Asp Leu Lys Gly Asp Asp Thr Ala Val Arg Asp 
645 650 655 

Ala His Ser Lys Arg Asp Thr Lys Cys Gin Pro Gin Gly Ser Ser Gly 
660 665 670 

Glu Glu Lys Gly Thr Pro Thr Thr Leu Arg Gly Gly Glu Ala Ser Glu 
675 680 685 

Arg Lys Arg Pro Asp Ser Gly Cys Ser Thr Ser Lys Asp Thr Lys Tyr 



Gin ser val Tyr val lie ser Glu Glu Lys Asp Glu cys Val lie Ala 
705 ' 710 715 720 



<210> 30 

<211> 1971 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Delta like 3 (Notch ligand) 








<400> 30 
gaaggccatg 


gtctccccac 


ggatgtccgg 


gctcctctcc 


cagactgtga 


tcctagcgct 


60 


cattttcctc 


ccccagacac 


ggcccgctgg 


cgtcttcgag 


ctgcagatcc 


actctttcgg 


120 


gccgggtcca 


ggccctgggg 


ccccgcggtc 


cccctgcagc 


gcccggctcc 


cctgccgcct 


180 


cttcttcaga 


gtctgcctga 


agcctgggct 


ctcagaggag 


gccgccgagt 


ccccgtgcgc 


240 


cctgggcgcg 


gcgctgagtg 


cgcgcggacc 


ggtctacacc 


gagcagcccg 


gagcgcccgc 


300 


gcctgatctc 


ccactgcccg 


acgggctctt 


gcaggtgccc 


ttccgggacg 


cctggcctgg 


360 


caccttctct 


ttcatcatcg 


aaacctggag 


agaggagtta 


ggagaccaga 


ttggagggcc 


420 


cgcctggagc 


ctgctggcgc 


gcgtggctgg 


caggcggcgc 


ttggcagccg 


gaggcccgtg 


480 


ggcccgggac 


attcagcgcg 


caggcgcctg 


ggagctgcgc 


ttctcgtacc 


gcgcgcgctg 


540 


cgagccgcct 


gccgtcggga 


ccgcgtgcac 


gcgcctctgc 


cgtccgcgca 


gcgccccctc 


600 


gcggtgcggt 


ccgggactgc 


gcccctgcgc 


accgctcgag 


gacgaatgtg 


aggcgccgct 


660 


ggtgtgccga 


gcaggctgca 


gccctgagca 


tggcttctgt 


gaacagcccg 


gtgaatgccg 


720 


atgcctagag 


ggctggactg 


gacccctctg 


cacggtccct 


gtctccacca 


gcagctgcct 


780 


cagccccagg 


ggcccgtcct 


ctgctaccac 


cggatgcctt 


gtccctgggc 


ctgggccctg 


840 


tgacgggaac 


ccgtgtgcca 


atggaggcag 


ctgtagtgag 


acacccaggt 


cctttgaatg 


900 


cacctgcccg 


cgtgggttct 


acgggctgcg 


gtgtgaggtg 


agcggggtga 


catgtgcaga 


960 



122/166 



WO 2005/014854 PCT/EP2004/008819 
39467A.txt.txt 

tggaccctgc ttcaacggcg gcttgtgtgt cgggggtgca gaccctgact ctgcctacat 1020 

ctgccactgc ccacctggtt tccaaggctc caactgtgag aagagggtgg accggtgcag 1080 

cctgcagcca tgccgcaatg gcggactctg cctggacctg ggccacgccc tgcgctgccg 1140 

ctgccgcgcc ggcttcgcgg gtcctcgctg cgagcacgac ctggacgact gcgcgggccg 1200 

cgcctgcgct aacggcggca cgtgtgtgga gggcggcggc gcgcaccgct gctcctgcgc 1260 

gctgggcttc ggcggccgcg actgccgcga gcgcgcggac ccgtgcgccg cgcgcccctg 1320 

tgctcacggc ggccgctgct acgcccactt ctccggcctc gtctgcgctt gcgctcccgg 1380 

ctacatggga gcgcggtgtg agttcccagt gcaccccgac ggcgcaagcg ccttgcccgc 1440 

ggccccgccg ggcctcaggc ccggggaccc tcagcgctac cttttgcctc cggctctggg 1500 

actgctcgtg gccgcgggcg tggccggcgc tgcgctcttg ctggtccacg tgcgccgccg 1560 

tggccactcc caggatgctg ggtctcgctt gctggctggg accccggagc cgtcagtcca 1620 

cgcactcccg gatgcactca acaacctaag gacgcaggag ggttccgggg atggtccgag 1680 

ctcgtccgta gattggaatc gccctgaaga tgtagaccct caagggattt atgtcatatc 1740 

tgctccttcc atctacgctc gggaggtagc gacgcccctt ttccccccgc tacacactgg 1800 

gcgcgctggg cagaggcagc acctgctttt tccctaccct tcctcgattc tgtccgtgaa 1860 

atgaattggg tagagtctct ggaaggtttt aagcccattt tcagttctaa cttactttca 1920 

tcctattttg catccctctt atcgttttga gctacctgcc atcttctctt t 1971 

<210> 31 
<211> 618 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Delta like 3 (Notch ligand) 

<400> 31 

Met val Ser Pro Arg Met Ser Gly Leu Leu Ser Gin Thr Val lie Leu 
15 10 15 



Ala Leu He Phe Leu Pro Gin Thr Arg Pro Ala Gly Val Phe Glu Leu 
20 25 30 



Gin lie His ser Phe Gly pro Gly Pro Gly Pro Gly Ala Pro Arg Ser 
35 40 45 



Pro cys Ser Ala Arg Leu Pro Cys Arg Leu Phe Phe Arg Val Cys Leu 
50 55 60 



Lys pro Gly Leu Ser Glu Glu Ala Ala Glu Ser Pro Cys Ala Leu Gly 
65 " 70 75 80 



Ala Ala Leu Ser Ala Arg Gly Pro Val Tyr Thr Glu Gin Pro Gly Ala 
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85 90 95 

Pro Ala Pro Asp Leu Pro Leu Pro Asp Gly Leu Leu Gin Val Pro Phe 
100 105 110 

Arg Asp Ala Trp Pro Gly Thr Phe ser Phe lie lie Glu Thr Trp Arg 
115 120 125 

Glu Glu Leu Gly Asp Gin lie Gly Gly Pro Ala Trp Ser Leu Leu Ala 
130 135 140 

Arg Val Ala Gly Arg Arg Arg Leu Ala Ala Gly Gly Pro Trp Ala Arg 
145 150 155 160 

Asp lie Gin Arg Ala Gly Ala Trp Glu Leu Arg Phe Ser Tyr Arg Ala 
165 170 175 

Arg cys Glu Pro Pro Ala Val Gly Thr Ala Cys Thr Arg Leu Cys Arg 
180 185 190 

Pro Arg Ser Ala Pro Ser Arg Cys Gly Pro Gly Leu Arg Pro cys Ala 
195 200 205 

Pro Leu Glu Asp Glu Cys Glu Ala Pro Leu Val Cys Arg Ala Gly Cys 
210 215 220 

Ser Pro Glu His Gly Phe Cys Glu Gin Pro Gly Glu cys Arg Cys Leu 
225 230 235 240 

Glu Gly Trp Thr Gly Pro Leu Cys Thr Val Pro Val Ser Thr Ser Ser 
245 250 255 

Cys Leu Ser Pro Arg Gly Pro ser Ser Ala Thr Thr Gly cys Leu Val 
260 265 270 

Pro Gly pro Gly Pro cys Asp Gly Asn Pro cys Ala Asn Gly Gly ser 
275 280 285 

Cys ser Glu Thr Pro Arg Ser Phe Glu Cys Thr Cys Pro Arg Gly Phe 
290 295 300 

Tyr Gly Leu Arg Cys Glu Val Ser Gly Val Thr Cys Ala Asp Gly Pro 
305 310 315 320 

cys phe Asn Gly Gly Leu cys val Gly Gly Ala Asp Pro Asp Ser Ala 
325 330 335 

Tyr lie Cys His Cys Pro Pro Gly Phe Gin Gly Ser Asn Cys Glu Lys 
340 345 350 

Arg Val Asp Arg cys Ser Leu Gin Pro cys Arg Asn Gly Gly Leu cys 



WO 2005/014854 PCT/EP2004/008819 
355 

Leu Asp Leu Gly His Ala Leu Arg Cys Arg cys Arg Ala Gly Phe Ala 
370 375 380 

Gly Pro Arg cys Glu Hi's Asp Leu Asp Asp Cys Ala Gly Arg Ala Cys 
385 ~ 390 395 400 

Ala Asn Gly Gly Thr cys val Glu Gly Gly Gly Ala His Arg cys ser 
405 410 415 

Cys Ala Leu Gly Phe Gly Gly Arg Asp cys Arg Glu Arg Ala Asp Pro 
420 425 430 

cys Ala Ala Arg Pro Cys Ala His Gly Gly Arg cys Tyr Ala His Phe 
435 440 445 

ser Gly Leu val cys Ala Cys Ala Pro Gly Tyr Met Gly Ala Arg cys 
450 455 460 

Glu Phe Pro Val His Pro Asp Gly Ala Ser Ala Leu Pro Ala Ala Pro 
465 470 475 480 

Pro Gly Leu Arg Pro Gly Asp Pro Gin Arg Tyr Leu Leu Pro Pro Ala 
485 490 495 

Leu Gly Leu Leu Val Ala Ala Gly Val Ala Gly Ala Ala Leu Leu Leu 
500 . 505 510 

val His val Arg Arg Arg Gly His Ser Gin Asp Ala Gly ser Arg Leu 
515 520 525 

Leu Ala Gly Thr Pro Glu Pro ser val His Ala Leu Pro Asp Ala Leu 
530 535 540 

Asn Asn Leu Arg Thr Gin Glu Gly Ser Gly Asp Gly Pro Ser Ser Ser 
545 550 555 560 

Val Asp Trp Asn Arg Pro Glu Asp val Asp Pro Gin Gly lie Tyr Val 
565 570 575 

lie ser Ala Pro Ser lie Tyr Ala Arg Glu Val Ala Thr Pro Leu phe 
580 585 590 

Pro Pro Leu His Thr Gly Arg Ala Gly Gin Arg Gin His Leu Leu Phe 
595 600 605 

Pro Tyr Pro Ser Ser lie Leu Ser Val Lys 
610 615 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Delta like 4 (Notch ligand) 

<400> 32 

gctgcgcgca ggccgggaac acgaggccaa gagccgcagc cccagccgcc ttggtgcagc 60 

gtacaccggc actagcccgc ttgcagcccc aggattagac agaagacgcg tcctcggcgc 120 

ggtcgccgcc cagccgtagt cacctggatt acctacagcg gcagctgcag cggagccagc 180 

gagaaggcca aaggggagca gcgtcccgag aggagcgcct cttttcaggg accccgccgg 240 

ctggcggacg cgcgggaaag cggcgtcgcg aacagagcca gattgagggc ccgcgggtgg 300 

agagagcgac gcccgagggg atggcggcag cgtcccggag cgcctctggc tgggcgctac 360 

tgctgctggt ggcactttgg cagcagcgcg cggccggctc cggcgtcttc cagctgcagc 420 

tgcaggagtt catcaacgag cgcggcgtac tggccagtgg gcggccttgc gagcccggct 480 

gccggacttt cttccgcgtc tgccttaagc acttccaggc ggtcgtctcg cccggaccct 540 

gcaccttcgg gaccgtctcc acgccggtat tgggcaccaa ctccttcgct gtccgggacg 600 

acagtagcgg cggggggcgc aaccctctcc aactgccctt caatttcacc tggccgggta 660 

ccttctcgct catcatcgaa gcttggcacg cgccaggaga cgacctgcgg ccagaggcct 720 

tgccaccaga tgcactcatc agcaagatcg ccatccaggg ctccctagct gtgggtcaga 780 

actggttatt ggatgagcaa accagcaccc tcacaaggct gcgctactct taccgggtca 840 

tctgcagtga caactactat ggagacaact gctcccgcct gtgcaagaag cgcaatgacc 900 

acttcggcca ctatgtgtgc cagccagatg gcaacttgtc ctgcctgccc ggttggactg 960 

gggaatattg ccaacagcct atctgtcttt cgggctgtca tgaacagaat ggctactgca 1020 

gcaagccagc agagtgcctc tgccgcccag gctggcaggg ccggctgtgt aacgaatgca 1080 

tcccccacaa tggctgtcgc cacggcacct gcagcactcc ctggcaatgt acttgtgatg 1140 

agggctgggg aggcctgttt tgtgaccaag atctcaacta ctgcacccac cactccccat 1200 

gcaagaatgg ggcaacgtgc tccaacagtg ggcagcgaag ctacacctgc acctgtcgcc 1260 

caggctacac tggtgtggac tgtgagctgg agctcagcga gtgtgacagc aacccctgtc 1320 

gcaatggagg cagctgtaag gaccaggagg atggctacca ctgcctgtgt cctccgggct 1380 

actatggcct gcattgtgaa cacagcacct tgagctgcgc cgactccccc tgcttcaatg 1440 

ggggctcctg ccgggagcgc aaccaggggg ccaactatgc ttgtgaatgt ccccccaact 1500 

tcaccggctc caactgcgag aagaaagtgg acaggtgcac cagcaacccc tgtgccaacg 1560 

ggggacagtg cctgaaccga ggtccaagcc gcatgtgccg ctgccgtcct ggattcacgg 1620 

gcacctactg tgaactccac gtcagcgact gtgcccgtaa cccttgcgcc cacggtggca 1680 

cttgccatga cctggagaat gggctcatgt gcacctgccc tgccggcttc tctggccgac 1740 
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gctgtgaggt 


gcggacatcc atcgatgcct 


39467A.txt 
y n-y dy 


.txt 

"t c c c "t g c "t "t c 


aacagggcca 




c c "t g C "t 3. C 3. c 


cgacctctcc acagacacct 


ttgtgtgcaa 


ctgcccttat 


ggctttgtgg 




gcagccgctg 


cgagttcccc gtgggcttgc 


cgcccagctt 


cccctgggtg 


gccgtctcgc 




"tgggtgtggg 


gctggcagtg ctgctggtac 


tgctgggcat 


ggtggcagtg 


gctgtgcggc 




ciye Ly <-yy <- l 


tcgacggccg gacgacggca 


gcag ggaag c 


catgaacaac 


ttgtcggact 




tccagaagga 


caacctgatt 


cctgccgccc 


agcttaaaaa 


cacaaaccag 


aagaaggagc 


2100 


tggaagtgga 


ctgtggcctg gacaagtcca 


actgtggcaa 


acagcaaaac 


cacacattgg 




actataatct 


ggccccaggg cccctggggc 


gggggsccat 


gccaggaaag 


ttt ccccaca 




gtgacaag ag 


cttaggagag aaggcgccac 


tgcggttaca 


cagtgaaaag 


ccagagtgt c 




ggatatcagc 


gatatgctcc 


cccagggact 


ccat gtacca 


gtctgtgtgt 


"t tig at at cag 




aggagaggaa 


tgaatgtgtc 


attgccacgg 


aggta.taa.gg 


caggagccta 


cctggacatc 






cccgcggctg 


gaccttcctt 


ctgcattgtt 


tacattgcat 




2460 


acgttttt ca 


tatgcaacgt gctgctctca 


ctt 9 cl!Icc 


ggaatggcag 


g^ccggaca 






ttgccaagag 


atgcaatacc 






tgtctggcat 


2580 


cagattggca 


gctgcaccaa 


ccagaggaac 


agaa.gags.ag 


agagatgcca 


ctgggcac tg 


2640 


ccct9Cca9t 


agtggccttc 


agggggctcc 








2700 


agtggcagta 


gccccatggg 


gcccggagct 


gctgtggcct 


ccactggcat 


ccgtgtt t cc 


2760 


aaaagtgcct 


ttggcccagg 


ctccacggcg 


acagttgggc 






2820 


ggggccaatg 


agggcagggc 


ctcctgtggg 


ctggaaaacc 


actgggtgcg 


tct cttg ctg 


2880 


gggtttgccc 


tggaggtgag 


gtgagtgctc 


gagggagggg 


agtgctttct 


gccccatgcc 


2940 




gtatgcaggc 


ctggctctct 




ctttgggcaa 


gaatgtccgt 


3000 


ctacccggct 


tccaccaccc 


tctggccctg 


ggcttctgta 


agcagacagg 


cagagggcct 


3060 


gcccctccca 


ccagccaagg 


gtgccaggcc 








3120 


aattccactg 


agggggaaat 


caggtgctgc 


ggccgcctgg 


gccctttcct 


ccctcaagcc 


3180 


catctccaca 


acctcgagcc 


tgggctctgg 


tccactactg 


ccccagacca 


ccctcaaagc 




tggtcttcag 


aaatcaataa tatgagtttt 


tattttgttt 


tttttttttt 


ttttgtagtt 


3300 


tattttggag 


tctagtattt 


caataattta 


agaatcagaa 


gcactgacct 


ttctacattt 


3360 


tataacatta 


ttttgtatat aat 








3383 



<210> 33 

<211> 685 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> rrn"sc_feature 

<223> Delta like 4 (Notch ligand) 
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Met Ala Ala Ala ser Arg Ser Ala Ser Gly Trp Ala Leu Leu Leu Leu 
15 10 15 

val Ala Leu Trp Gin Gin Arg Ala Ala Gly Ser Gly Val Phe Gin Leu 
20 25 30 

Gin Leu Gin Glu phe lie Asn Glu Arg Gly Val Leu Ala Ser Gly Arg 
35 40 45 

Pro Cys Glu Pro Gly Cys Arg Thr Phe Phe Arg Val Cys Leu Lys His 



Phe Gin Ala val val ser pro Gly Pro Cys Thr Phe Gly Thr Val ser 
65 70 75 80 



Thr Pro Val Leu Gly Thr Asn Ser Phe Ala Val Arg Asp Asp Ser Ser 
85 90 95 

Gly Gly Gly Arg Asn Pro Leu Gin Leu Pro Phe Asn Phe Thr Trp Pro 
100 105 110 

Gly Thr Phe ser Leu lie lie Glu Ala Trp His Ala Pro Gly Asp Asp 
115 120 125 

Leu Arg Pro Glu Ala Leu Pro Pro Asp Ala Leu He Ser Lys lie Ala 
130 135 140 

He Gin Gly Ser Leu Ala Val Gly Gin Asn Trp Leu Leu Asp Glu Gin 
145 150 155 160 

Thr ser Thr Leu Thr Arg Leu Arg Tyr Ser Tyr Arg Val He Cys Ser 
165 170 175 

Asp Asn Tyr Tyr Gly Asp Asn Cys Ser Arg Leu cys Lys Lys Arg Asn 
180 185 190 

Asp His Phe Gly His Tyr val Cys Gin Pro Asp Gly Asn Leu Ser Cys 
195 200 205 

Leu Pro Gly Trp Thr Gly Glu Tyr Cys Gin Gin Pro lie Cys Leu Ser 
210 215 220 

Gly cys His Glu Gin Asn Gly Tyr Cys Ser Lys Pro Ala Glu cys Leu 
225 230 235 240 

Cys Arg Pro Gly Trp Gin Gly Arg Leu cys Asn Glu Cys lie pro His 
245 250 255 

Asn Gly cys Arg His Gly Thr Cys ser Thr Pro Trp Gin Cys Thr Cys 
260 265 270 
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Asp Glu Gly Trp Gly Gly Leu Phe Cys Asp Gin Asp Leu Asn Tyr Cys 
275 280 285 

Thr His His ser pro cys Lys Asn Gly Ala Thr cys ser Asn Ser Gly 
290 295 300 

Gin Arg Ser Tyr Thr Cys Thr Cys Arg Pro Gly Tyr Thr Gly Val Asp 
305 310 315 320 

cys Glu Leu Glu Leu Ser Glu Cys Asp Ser Asn Pro Cys Arg Asn Gly 
325 330 335 

Gly Ser Cys Lys Asp Gin Glu Asp Gly Tyr His Cys Leu Cys Pro Pro 
340 345 350 

Gly Tyr Tyr Gly Leu His Cys Glu His Ser Thr Leu Ser Cys Ala Asp 
355 360 365 

Ser Pro Cys Phe Asn Gly Gly ser cys Arg Glu Arg Asn Gin Gly Ala 
370 375 380 

Asn Tyr Ala Cys Glu Cys Pro Pro Asn Phe Thr Gly Ser Asn Cys Glu 
385 390 395 400 

Lys Lys Val Asp Arg Cys Thr Ser Asn Pro Cys Ala Asn Gly Gly Gin 
405 410 415 

cys Leu Asn Arg Gly Pro ser Arg Met cys Arg cys Arg Pro Gly Phe 
420 425 430 

Thr Gly Thr Tyr Cys Glu Leu His Val Ser Asp Cys Ala Arg Asn Pro 
435 440 445 

Cys Ala His Gly Gly Thr cys His Asp Leu Glu Asn Gly Leu Met Cys 
450 455 460 

Thr Cys Pro Ala Gly Phe Ser Gly Arg Arg Cys Glu Val Arg Thr Ser 
465 470 475 480 

lie Asp Ala Cys Ala Ser Ser Pro Cys Phe Asn Arg Ala Thr Cys Tyr 
485 490 495 

Thr Asp Leu Ser Thr Asp Thr Phe val cys Asn cys pro Tyr Gly Phe 
500 505 510 

Val Gly ser Arg Cys Glu Phe Pro Val Gly Leu Pro Pro Ser Phe Pro 
515 520 525 

Trp Val Ala Val Ser Leu Gly Val Gly Leu Ala Val Leu Leu Val Leu 
530 535 540 
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Leu Gly Met val Ala val Ala val Arg Gin Leu Arg Leu Arg Arg pro 
545 550 555 560 

Asp Asp Gly ser Arg Glu Ala Met Asn Asn Leu Ser Asp Phe Gin Lys 
565 570 575 

Asp Asn Leu lie Pro Ala Ala Gin Leu Lys Asn Thr Asn Gin Lys Lys 
580 585 590 

Glu Leu Glu Val Asp Cys Gly Leu Asp Lys Ser Asn Cys Gly Lys Gin 
595 600 605 

Gin Asn His Thr Leu Asp Tyr Asn Leu Ala Pro Gly Pro Leu Gly Arg 
610 615 620 

Gly Thr Met Pro Gly Lys Phe Pro His ser Asp Lys Ser Leu Gly Glu 
625 630 635 640 

Lys Ala Pro Leu Arg Leu His Ser Glu Lys pro Glu Cys Arg lie Ser 
645 650 655 

Ala lie Cys Ser Pro Arg Asp Ser Met Tyr Gin Ser Val Cys Leu lie 
660 665 670 

ser Glu Glu Arg Asn Glu cys Val lie Ala Thr Glu Val 
675 680 685 

<210> 34 

<211> 5077 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Jagged2, transcript variant 1 






<400> 34 
ctcatgcata 


tgcaggtgcg 


cgggtgacga 


atgggcgagc gagctgtcag 


tctcgttccg 


60 


aacttgttgg 


ctgcggtgcc 


gggagcgcgg 


gcgcgcagag ccgaggccgg 


gacccgctgc 


120 


cttcaccgcc 


gccgccgtcg 


ccgccgggtg 


ggagccgggc cgggcagccg 


gagcgcggcc 


180 


gccagcgagc 


cggagctgcc 


gccgcccctg 


cacgcccgcc gcccaggccc 


gcgcgccgcg 


240 


gcgctgcgct 


cgaccccgcc 


cgcgccgccg 


ccgccgccgc ctctgccgct 


gccgctgcct 


300 


ctgcgggcgc 


tcggagggcg 


ggcgggcgct 


gggaggccgg cgcggcggct 


gggagccggg 


360 


cgcgggcggc 


ggcggcgggg 


ccgggcgggc 


gggtcgcggg ggcaatgcgg 


gcgcagggcc 


420 


gggggcgcct 


tccccggcgg 


ctgctgctgc 


tgctggcgct ctgggtgcag 


gcggcgcggc 


480 


ccatgggcta 


tttcgagctg 


cagctgagcg 


cgctgcggaa cgtgaacggg 


gagctgctga 


540 


gcggcgcctg 


ctgtgacggc 


gacggccgga 


caacgcgcgc ggggggctgc 


ggccacgacg 


600 
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agtgcgacac 
ggccctgcag 
cgccggcggg 
acccgggcct 
tggaggcctg 
tgtcgcatgc 
acgtggcgca 
cttgcaacaa 
acggcaacaa 
aacaagggtg 
acggctggca 
gttgtgtgga 
aagacctgaa 
ccgagcctga 
aggctgagca 
cgtccggctt 
tcgatgagtg 
gctttgagtg 
agtgtgaagg 
actgtgattg 
gcgggcagtg 
gcccacgggg 
cctgccacag 
agggcttctc 
ggaacggcgc 
ttggtggcaa 
tcgatggctg 
gtggccccca 
gtggctttac 
gcaatggggg 
gggagggcga 
gcggccgctg 
gcaagacctg 
cctgctacga 



gtacgtgcgc gtgtgcctta 
ctacggccac ggcgccacgc 
cgctgcgggg gaccgagcgc 
cgtcgtcatc cccttccagt 
ggactgggac aacgatacca 
cggcatgatc aacccggagg 
cctggagctg cagatccgcg 
gttctgccgg ccccgcaacg 
ggcctgcatg gacggctgga 
taatttgctc cacgggggat 
agggaggttc tgcgatgagt 
gccctggcag tgcaactgtg 
ctactgtggc agccaccacc 
ccagtaccgc tgcacctgcc 
cgcctgcacc tccaacccgt 
cgaatgccac tgcccatcgg 
tgcttcgaac ccgtgtgcgg 
catctgcccc gagcagtggg 
gaagccatgc cttaacgctt 
catcccgggc tggaagggca 
tcagcatggg ggcacctgca 
cttcggaggc cggcattgcg 
cggcggcctc tgcgaggacc 
cgggcctctc tgtgaggtgg 
tcgctgctat aacctggagg 
gaactgctcc gtgccccgcg 
cgggtcagac gcggggcctg 
tggacgctgc gtcagccagc 
tggcacctac tgccatgaga 
cacatgcatc gatgaggtgg 
gctctgcgac accaatccca 
ctacgacctg gtcaatgact 
ccactcacgc gagttccagt 
cagcggcgac accttccgct 



39467A.txt.txt 
aggagtacca ggccaaggtg 

ccgtgctggg cggcaactcc 

gggcgcgggc ccgggccggc 

tcgcctggcc gcgctccttt 

ccccgaatga ggagctgctg 

accgctggaa gagcctgcac 

tgcgctgcga cgagaactac 

actttttcgg ccactacacc 

tgggcaagga gtgcaaggaa 

gcaccgtgcc tggggagtgc 

gtgtccccta ccccggctgc 

agaccaactg gggcggcctg 

cctgcaccaa cggaggcacg 

ctgacggcta ctcgggcagg 

gtgccaacgg gggctcttgc 

gctggagcgg gcccacctgt 

ccggtggcac ctgtgtggac 

tgggggccac ctgccagctg 

tttcttgcaa aaacctgatt 

tcaactgcca tatcaacgtc 

aggacctggt gaacgggtac 

agctggaacg agacgagtgt 

tggccgacgg cttccactgc 

atgtcgacct ttgtgagcca 

gtgactatta ctgcgcctgc 

agccgtgccc tggcggggcc 

ggatgcctgg cacagcagcc 

cagggggcaa cttttcctgc 

acattgacga ctgcctgggc 

acgccttccg ctgcttctgc 

acgactgcct tcccgatccc 

tctactgtgc gtgcgacgac 

gcgatgccta cacctgcagc 

gcgcctgccc ccccggctgg 



acgcccacgg 


660 


ttctacctgc 


720 


ggcgaccagg 


780 


accctcatcg 


840 


atcgagcgag 


900 


ttcagcggcc 


960 


tacagcgcca 


1020 


tgcgaccagt 


1080 


gctgtgtgta 


1140 


aggtgcagct 


1200 


gtgcatggca 


1260 


ctctgtgaca 


1320 


tgcatcaacg 


1380 


aactgtgaga 


1440 


catgaggtgc 


1500 


gcccttgaca 


1560 


caggtggacg 


1620 


gacgccaatg 


1680 


ggcggctatt 


1740 


aacgactgtc 


1800 


cagtgtgtgt 


1860 


gccagcagcc 


1920 


cactgccccc 


1980 


agcccctgcc 


2040 


cctgatgact 


2100 


tgcagagtga 


2160 


tccggcgtgt 


2220 


atctgtgaca 


2280 


cagccctgcc 


2340 


cccagcggct 


2400 


tgccacagcc 


2460 


ggctggaagg 


2520 


aacggtggca 


2580 


aagggcagca 


2640 



131/166 



WO 2005/014854 



PCT/EP2004/008819 



cctgcgccgti 


cgccaagaac 


39467A.txt 
agcagctgcc tgcccaaccc 


.txt 

ctgtgtgaat 


ggtggcacct 




gcgtgggcag 


cggggcctxc 


ttctcctgca tetgeeggga 


eggctgggag 


ggtegtaett 


2760 


gca.ctca.ca. a. 


taccaacgac 


tgcaaccctc tgecttgeta 


caatggtggc 


atctgtgttg 




acggcgtcaa 


ctggttccgc 


tgcgagtgtg cacctggctt 


cgcggggcct 


gactgccgca 




tcaacatcga 


cgagtgccag 


tcctcgccct gtgectaegg 


ggccacgtgt 


gtggatgaga 




tcaacgggta 


tegctgtage 


tgcccacccg gccgagccgg 


cccccggtgc 


caggaagtga 




tcgggttcgg 


gagatcctgc 


tggtcccggg gcactccgtt 


cccacacgga 


agctcctggg 




tggaagactg 


caacagctgc 


cgctgcctgg atggccgccg 


tgactgeage 


aaggtgtggt 




gcggatggaa 


gcc"ttgtic"tg 


ctggccggcc agcccgaggc 


cctgagcgcc 


cagtgcccac 




"tggggcaaag 


gtgectgejag 


aaggccccag gccagtgtct 


gcgaccaccc 


tgtgaggcct 




ggggggagtg 


eggegcagaa 


gagccaccga gcaccccctg 


cctgccacgc 


tccggccacc 




tg g acaat aa 


ctgtgcccgc 


ctcaccttgc atttcaaccg 


tgaccacgtg 


ccccagggca 




ccacggtggg 


cgccatttgc 


tccgggatcc gctccctgcc 


agecacaagg 


gctg tggcac 




gggaccgcct 


gctggtgttg 


ctttgcgacc gggcgtcctc 


gggggccagt 


gccgtggagg 




tggccgtgtc 


cttcagccct 


gccagggacc tgcctgacag 


cagcctgatc 


cagggegegg 


3540 


cccacgccat 


cgtiggccgcc 


atcacccagc gggggaacag 


ctcactgctc 


ctggctgtca 


3600 


ccgaggtcaa 


ggtggagacg 


gttgttacgg gcggctcttc 


cacaggtctg 


ctggtgcctg 


3660 


tgctgtgtgg 


tgccttcagc 


gtgctgtggc tggcgtgcgt 


ggtcctgtgc 


gtgtggtgga 




cacg caag eg 


caggaaagag 


egggagagga gccggctgcc 


gegggaggag 


agcgccaaca 


3780 


accagtgggc 


cccgctcaac 


cccatccgca accccatxga 


gcggccgggg 


ggecacaagg 


3840 


aegtgetcta 


ccagtgcaag 


aact"tcacgc cgccgccycg 


cagggeggae 




3900 


ccgggccggc 


cggccacgcg 


geegtcaggg aggatgagga 


caalttcacc 


ctgggccgcg 


3960 


gtgaggagga 


ctccctggag 


gcggagaagt tcctctcaca 




aaagatcctg 


4020 


gccgctcgcc 


ggggaggccg 


gcccactggg cctxaggccc 


caaagtggac 


aaccgcgcgg 


4080 


"tcaggagcat 


caatgaggee 


cgctacgccg gcaaggagta 


ggggcggctg 


ccglggcc!t 


4140 


cgggacccag 


ggccctcggt 


gggagccatg ccgtctgccg 


Icaaa!Icc! 




4200 


gtgeatag tx 


tctttatttt 






aatgtttatt 


4260 


1 1 c t a c g 1 1 1 


ctttaacctt 


gtltaaatta ttcagtaact 




aaacaatgga 


4320 


gtattctegg 




tttttgtaaa gtttccgtgc 


gtggcactcg 


ctgtatgaaa 


4380 


ggagagagca 


aagggtgtct 


gcgtcgtcac caaategtag 


cgtttgttac 


cagaggttgt 


4440 


gcactgttta 


cagaatcttc 


cttttattcc teactegggt 


ttctctgtgg 




4500 


aagtgccggt 


gagacccatg 


gctgtgttgg tgtggcccat 


ggctgttggt 


gggacccgtg 


4560 


gctgatggtg 


tggcctgtgg 


ctgtcggtgg gactcgtggc 


tgtcaatggg 


acctgtggct 


4620 


gtcggtggga 


cctacggtgg 


tcggtgggac cctggttatt 


gatgtggccc 


tggctgeegg 


4680 
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cacggcccgt ggctgttgac gcacctgtgg ttgttagtgg ggcctgaggt catcggcgtg 4740 

gcccaaggcc ggcaggtcaa cctcgcgctt gctggccagt ccaccctgcc tgccgtctgt 4800 

gcttcctcct gcccagaacg cccgctccag cgatctctcc actgtgcttt cagaagtgcc 4860 

cttcctgctg cgcagttctc ccatcctggg acggcggcag tattgaagct cgtgacaagt 4920 

gccttcacac agacccctcg caactgtcca cgcgtgccgt ggcaccaggc gctgcccacc 4980 

tgccggcccc ggccgcccct cctcgtgaaa gtgcattttt gtaaatgtgt acatattaaa 5040 

ggaagcactc tgtatatttg attgaataat gccacca 5077 

<210> 35 

<211> 1238 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> 3agged2, transcript variant 1 



Met Arg Ala Gin Gly Arg Gly Arg Leu Pro Arg Arg Leu Leu Leu Leu 
15 10 15 

Leu Ala Leu Trp Val Gin Ala Ala Arg Pro Met Gly Tyr phe Glu Leu 
20 25 30 

Gin Leu ser Ala Leu Arg Asn Val Asn Gly Glu Leu Leu Ser Gly Ala 
35 40 45 

Cys Cys Asp Gly Asp Gly Arg Thr Thr Arg Ala Gly Gly Cys Gly His 
50 55 60 

Asp Glu cys Asp Thr Tyr val Arg Val Cys Leu Lys Glu Tyr Gin Ala 
65 70 75 80 

Lys val Thr Pro Thr Gly Pro Cys Ser Tyr Gly His Gly Ala Thr Pro 
85 ' 90 95 

val Leu Gly Gly Asn Ser Phe Tyr Leu pro Pro Ala Gly Ala Ala Gly 
100 105 HO 

Asp Arg Ala Arg Ala Arg Ala Arg Ala Gly Gly Asp Gin Asp Pro Gly 
115 120 125 

Leu Val Val lie Pro Phe Gin Phe Ala Trp Pro Arg Ser Phe Thr Leu 
130 135 140 

lie Val Glu Ala Trp Asp Trp Asp Asn Asp Thr Thr Pro Asn Glu Glu 
145 150 155 160 

Leu Leu lie Glu Arg Val Ser His Ala Gly Met lie Asn Pro Glu Asp 
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Arg Trp Lys Ser Leu His Phe Ser Gly His val Ala His Leu Glu Leu 



Gin lie Arg val Arg cys Asp Glu Asn Tyr Tyr ser Ala Thr cys Asn 
195 200 205 

Lys Phe Cys Arg Pro Arg Asn Asp Phe Phe Gly His Tyr Thr cys Asp 
210 215 220 

Gin Tyr Gly Asn Lys Ala cys Met Asp Gly Trp Met Gly Lys Glu cys 
225 230 235 240 

Lys Glu Ala Val Cys Lys Gin Gly Cys Asn Leu Leu His Gly Gly cys 
245 250 255 

Thr Val Pro Gly Glu Cys Arg Cys Ser Tyr Gly Trp Gin Gly Arg Phe 
260 265 270 

cys Asp Glu Cys val Pro Tyr Pro Gly cys Val His Gly ser cys val 
275 280 285 

Glu Pro Trp Gin Cys Asn Cys Glu Thr Asn Trp Gly Gly Leu Leu Cys 
290 295 300 

Asp Lys Asp Leu Asn Tyr Cys Gly ser His His Pro Cys Thr Asn Gly 
305 ■ 310 315 320 

Gly Thr cys lie Asn Ala Glu pro Asp Gin Tyr Arg cys Thr cys Pro 
325 330 335 

Asp Gly Tyr ser Gly Arg Asn Cys Glu Lys Ala Glu His Ala Cys Thr 
340 345 350 

ser Asn Pro cys Ala Asn Gly Gly ser cys His Glu val Pro ser Gly 
355 360 365 

Phe Glu Cys His Cys Pro Ser Gly Trp Ser Gly Pro Thr Cys Ala Leu 
370 375 380 

Asp lie Asp Glu Cys Ala Ser Asn Pro Cys Ala Ala Gly Gly Thr Cys 
385 390 395 400 

val Asp Gin Val Asp Gly Phe Glu cys He cys Pro Glu Gin Trp val 
405 410 415 

Gly Ala Thr Cys Gin Leu Asp Ala Asn Glu Cys Glu Gly Lys Pro Cys 
420 425 430 

Leu Asn Ala Phe ser cys Lys Asn Leu lie Gly Gly Tyr Tyr Cys Asp 
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39467A. 



txt 



txt 



435 



440 



445 



Cys lie Pro Gly Trp Lys Gly lie Asn Cys His lie Asn Val Asn Asp 
450 455 460 



Cys Arg Gly Gin Cys Gin His Gly Gly Thr Cys Lys Asp Leu val Asn 
465 470 475 480 



Gly Tyr Gin Cys Val Cys Pro Arg Gly Phe Gly Gly Arg His cys Glu 
485 490 495 



Leu Glu Arg Asp Glu Cys Ala Ser Ser Pro Cys His Ser Gly Gly Leu 
500 505 510 



cys Glu Asp Leu Ala Asp Gly Phe His Cys His Cys Pro Gin Gly Phe 
515 520 525 



Ser Gly Pro Leu Cys Glu Val Asp Val Asp Leu Cys Glu Pro Ser Pro 
530 535 540 



Cys Arg Asn Gly Ala Arg Cys Tyr Asn Leu Glu Gly Asp Tyr Tyr cys 
545 550 555 560 



Ala cys Pro Asp Asp Phe Gly Gly Lys Asn Cys ser Val Pro Arg Glu 



Pro Cys Pro Gly Gly Ala Cys Arg Val lie Asp Gly Cys Gly ser Asp 
580 585 590 



Ala Gly Pro Gly Met Pro Gly Thr Ala Ala ser Gly Val Cys Gly Pro 
595 600 605 



His Gly Arg Cys Val Ser Gin Pro Gly Gly Asn Phe Ser Cys lie Cys 
610 615 620 



Asp Ser Gly Phe Thr Gly Thr Tyr Cys His Glu Asn lie Asp Asp cys 
625 630 635 640 



Leu Gly Gin Pro cys Arg Asn Gly Gly Thr Cys lie Asp Glu Val Asp 
645 650 655 



Ala Phe Arg Cys Phe Cys Pro Ser Gly Trp Glu Gly Glu Leu Cys Asp 
660 665 670 



Thr Asn Pro Asn Asp Cys Leu Pro Asp Pro Cys His ser Arg Gly Arg 
675 680 685 



cys Tyr Asp Leu Val Asn Asp Phe Tyr Cys Ala cys Asp Asp Gly Trp 
690 695 700 



Lys Gly Lys Thr Cys His Ser Arg Glu phe Gin cys Asp Ala Tyr Thr 
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705 710 715 720 

Cys Ser Asn Gly Gly Thr Cys Tyr Asp Ser Gly Asp Thr Phe Arg Cys 
725 730 735 

Ala cys Pro Pro Gly Trp Lys Gly ser Thr cys Ala val Ala Lys Asn 
740 745 750 

Ser Ser Cys Leu Pro Asn Pro Cys Val Asn Gly Gly Thr Cys Val Gly 
755 760 765 

Ser Gly Ala Ser Phe Ser Cys lie cys Arg Asp Gly Trp Glu Gly Arg 
770 775 '780 

Thr cys Thr Hi's Asn Thr Asn Asp Cys Asn Pro Leu Pro Cys Tyr Asn 
785 790 795 800 

Gly Gly He Cys Val Asp Gly Val Asn Trp Phe Arg cys Glu Cys Ala 
805 810 815 

Pro Gly Phe Ala Gly Pro Asp cys Arg lie Asn lie Asp Glu cys Gin 
820 825 830 

Ser ser Pro cys Ala Tyr Gly Ala Thr Cys Val Asp Glu lie Asn Gly 
835 840 845 

Tyr Arg cys ser cys Pro Pro Gly Arg Ala Gly Pro Arg cys Gin Glu 
850 855 860 

Val He Gly Phe Gly Arg ser cys Trp ser Arg Gly Thr Pro phe Pro 
865 870 875 880 

His Gly Ser Ser Trp Val Glu Asp Cys Asn Ser Cys Arg Cys Leu Asp 
885 890 895 

Gly Arg Arg Asp Cys Ser Lys val Trp cys Gly Trp Lys Pro cys Leu 
900 905 910 

Leu Ala Gly Gin Pro Glu Ala Leu Ser Ala Gin Cys Pro Leu Gly Gin 
915 920 925 

Arg cys Leu Glu Lys Ala Pro Gly Gin Cys Leu Arg Pro Pro Cys Glu 
930 935 940 

Ala Trp Gly Glu Cys Gly Ala Glu Glu Pro Pro ser Thr Pro cys Leu 
945 950 955 960 

Pro Arg Ser Gly His Leu Asp Asn Asn Cys Ala Arg Leu Thr Leu His 
965 970 975 

Phe Asn Arg Asp His val Pro Gin Gly Thr Thr val Gly Ala He Cys 
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Ser Gly lie Arg Ser Leu Pro Ala Thr Arg Ala Val Ala Arg Asp Arg 
995 1000 1005 

Leu Leu val Leu Leu cys Asp Arg Ala ser ser Gly Ala Ser Ala 
1010 1015 1020 

Val Glu Val Ala Val Ser Phe Ser Pro Ala Arg Asp Leu Pro Asp 
1025 1030 1035 

Ser Ser Leu lie Gin Gly Ala Ala His Ala lie Val Ala Ala lie 
1040 1045 1050 

Thr Gin Arg Gly Asn Ser Ser Leu Leu Leu Ala Val Thr Glu Val 
1055 1060 1065 

Lys Val Glu Thr Val Val Thr Gly Gly Ser Ser Thr Gly Leu Leu 
1070 1075 1080 

val Pro val Leu Cys Gly Ala Phe ser Val Leu Trp Leu Ala Cys 
1085 1090 1095 

Val Val Leu Cys Val Trp Trp Thr Arg Lys Arg Arg Lys Glu Arg 
1100 1105 1110 

Glu Arg Ser Arg Leu Pro Arg Glu Glu Ser Ala Asn Asn Gin Trp 
1115 " 1120 1125 

Ala Pro Leu Asn Pro lie Arg Asn Pro lie Glu Arg Pro Gly Gly 
1130 1135 1140 

His Lys Asp Val Leu Tyr Gin Cys Lys Asn Phe Thr Pro Pro Pro 
1145 1150 1155 

Arg Arg Ala Asp Glu Ala Leu Pro Gly Pro Ala Gly His Ala Ala 
1160 1165 1170 

Val Arg Glu Asp Glu Glu Asp Glu Asp Leu Gly Arg Gly Glu Glu 
1175 1180 1185 

Asp Ser Leu Glu Ala Glu Lys Phe Leu Ser His Lys Phe Thr Lys 
1190 1195 1200 

Asp pro Gly Arg ser Pro Gly Arg Pro Ala His Trp Ala Ser Gly 
1205 1210 1215 

Pro Lys Val Asp Asn Arg Ala Val Arg Ser lie Asn Glu Ala Arg 
1220 1225 1230 

Tyr Ala Gly Lys Glu 
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<210> 36 

<211> 2223 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Hey-1 

<400> 36 

tcagtgtgtg cggaacgcaa gcagccgaga gcggagaggc gccgctgtag ttaactcctc 60 

cctgcccgcc gcgccgaccc tccccaggaa cccccaggga gccagcatga agcgagctca 120 

ccccgagtac agctcctcgg acagcgagct ggacgagacc atcgaggtgg agaaggagag 180 

tgcggacgag aatggaaact tgagttcggc tctaggttcc atgtccccaa ctacatcttc 240 

ccagattttg gccagaaaaa gacggagagg aataattgag aagcgccgac gagaccggat 300 

caataacagt ttgtctgagc tgagaaggct ggtacccagt gcttttgaga agcagggatc 360 

tgctaagcta gaaaaagccg agatcctgca gatgaccgtg gatcacctga aaatgctgca 420 

tacggcagga gggaaaggtt actttgacgc gcacgccctt gctatggact atcggagttt 480 

gggatttcgg gaatgcctgg cagaagttgc gcgttatctg agcatcattg aaggactaga 540 

tgcctctgac ccgcttcgag ttcgactggt ttcgcatctc aacaactacg cttcccagcg 600 

ggaagccgcg agcggcgccc acgcgggcct cggacacatt ccctggggga ccgtcttcgg 660 

acatcacccg cacatcgcgc acccgctgtt gctgccccag aacggccacg ggaacgcggg 720 

caccacggcc tcacccacgg aaccgcacca ccagggcagg ctgggctcgg cacatccgga 780 

ggcgcctgct ttgcgagcgc cccctagcgg cagcttcgga ccggtgctcc ctgtggtcac 840 

ctccgcctcc aaactgtcgc tgcctctgct ctcctcagtg gcctccctgt cggccttccc 900 

cttctctttc ggctccttcc acttactgtc tcccaatgca ctgagccctt cagcacccac 960 

gcaggctgca aaccttggca agccctatag accttggggg acggagatcg gagcttttta 1020 

aagaactgat gtagaatgag ggaggggaaa gtttaaaatc ccagctgggc tggactgttg 1080 

ccaacatcac cttaaagtcg tcagtaaaag taaaaaggaa aaaggtacac tttcagataa 1140 

tttttttttt aaagactaaa ggtttgttgg tttactttta tcttttttaa tgtttttttc 1200 

atcatgtcat gtattagcag tttttaaaaa ctagttgtta aattttgttc aagacattaa 1260 

attgaaatag tgagtataag ccaacacttt gtgataggtt tgtactgtgc ctaatttact 1320 

ttgtaaacca gaatgattcc gtttttgcct caaaatttgg ggaatcttaa catttaggta 1380 

tttttggtct gtttttctcc ttgtatagtt atggtctgtt tttagaatta attttccaaa 1440 

ccactatgct taatgttaac atgattctgt ttgttaatat tttgacagat taaggtgttg 1500 

tataaataat attcttttgg ggggagggga actatattga attttatatt tctgagcaaa 1560 

gcgttgacaa atcagatgat cagctttatc caagaaagaa gactagtaaa ttgtctgcct 1620 
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cctatagcag aaaggtgaat gtacaaactg ttggtggcct gaatccatct gaccagctgc 1680 

tggtatctgc caggactggc agttctgatt tagttaggag gaccgctgat aggttaggtc 1740 

tcatttggag tgttggtgga aaggaaactg aaggtaattg aatagaatac gcctgcattt 1800 

accagcccca gcaacacaaa gaatttttaa tcacacggat ctcaaattca caaatgttaa 1860 

catggataag tgatcatggt gtgcgagtgg tcaattgagt agtacagtgg aaactgttaa 1920 

atgcataacc taattttcct gggactgcca tattttcttt taactggaaa tttttatgtg 1980 

agttttcctt ttggtgcatg gaactgtggt tgccaaggta tttaaaaggg ctttcctgcc 2040 

tccttctctt tgatttattt aatttgattt gggctataaa atatcatttt tcaggtttat 2100 

tcttttagca ggtgtagtta aacgacctcc actgaactgg gtttgacctc tgttgtactg 2160 

atgtgttgtg actaaataaa aaagaaagaa caaagtaaaa aaaaaaaaaa aaaaaaaaaa 2220 

aaa 2223 

<210> 37 

<211> 304 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Hey-1 



Met Lys Arg Ala His Pro Glu Tyr Ser Ser Ser Asp Ser Glu Leu Asp 
1 5 10 15 

Glu Thr lie Glu Val Glu Lys Glu ser Ala Asp Glu Asn Gly Asn Leu 
20 25 30 

Ser Ser Ala Leu Gly Ser Met Ser Pro Thr Thr Ser Ser Gin lie Leu 
35 40 45 

Ala Arg Lys Arg Arg Arg Gly lie lie Glu Lys Arg Arg Arg Asp Arg 
50 55 60 

lie Asn Asn Ser Leu Ser Glu Leu Arg Arg Leu Val Pro Ser Ala phe 
65 70 75 80 

Glu Lys Gin Gly Ser Ala Lys Leu Glu Lys Ala Glu lie Leu Gin Met 
85 90 95 

Thr Val Asp His Leu Lys Met Leu His Thr Ala Gly Gly Lys Gly Tyr 
100 105 110 

Phe Asp Ala His Ala Leu Ala Met Asp Tyr Arg Ser Leu Gly Phe Arg 
115 120 125 

Glu cys Leu Ala Glu Val Ala Arg Tyr Leu Ser lie lie Glu Gly Leu 
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130 135 



Asp Ala ser Asp Pro Leu Arg val Arg Leu val ser His Leu Asn Asn 
145 150 155 160 

Tyr Ala Ser Gin Arg Glu Ala Ala Ser Gly Ala His Ala Gly Leu Gly 
165 170 175 

His lie Pro Trp Gly Thr Val Phe Gly His His Pro His lie Ala His 
180 185 190 

pro Leu Leu Leu Pro Gin Asn Gly His Gly Asn Ala Gly Thr Thr Ala 
195 200 205 

ser Pro Thr Glu Pro His His Gin Gly Arg Leu Gly ser Ala His Pro 
210 215 220 

Glu Ala Pro Ala Leu Arg Ala pro Pro ser Gly Ser Phe Gly Pro Val 
225 230 235 240 

Leu Pro val val Thr Ser Ala Ser Lys Leu Ser Leu Pro Leu Leu ser 
245 250 255 

ser val Ala Ser Leu ser Ala Phe Pro Phe ser Phe Gly ser Phe His 
260 265 270 

Leu Leu ser Pro Asn Ala Leu Ser Pro Ser Ala Pro Thr Gin Ala Ala 
275 280 285 

Asn Leu Gly Lys Pro Tyr Arg Pro Trp Gly Thr Glu lie Gly Ala Phe 
290 295 300 



<210> 38 

<211> 2533 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Hey-2 



<400> 38 
tcggcgtccg 


agcttccggc 


cgggctgtgc 


cccgcgcggt cttcgccggg atgaagcgcc 


60 


cctgcgagga 


gacgacctcc 


gagagcgaca 


tggacgagac catcgacgtg gggagcgaga 


120 


acaattactc 


ggggcaaagt 


actagctctg 


tgattagatt gaattctcca acaacaacat 


180 


ctcagattat 


ggcaagaaag 


aaaaggagag 


ggattataga gaaaaggcgt cgggatcgga 


240 


taaataacag 


tttatctgag 


ttgagaagac 


ttgtgccaac tgcttttgaa aaacaaggat 


300 


ctgcaaagtt 


agaaaaagct 


gaaatattgc 


aaatgacagt ggatcatttg aagatgcttc 


360 


aggcaacagg 


gggtaaaggc 


tactttgacg 


cacacgctct tgccatggac ttcatgagca 


420 
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taggattccg agagtgccta acagaagttg cgcggtacct gagctccgtg gaaggcctgg 480 

actcctcgga tccgctgcgg gtgcggcttg tgtctcatct cagcacttgc gccacccagc 540 

gggaggcggc ggccatgaca tcctccatgg cccaccacca tcatccgctc cacccgcatc 600 

actgggccgc cgccttccac cacctgcccg cagccctgct ccagcccaac ggcctccatg 660 

cctcagagtc aaccccttgt cgcctctcca caacttcaga agtgcctcct gcccacggct 720 

ctgctctcct cacggccacg tttgcccatg cggattcagc cctccgaatg ccatccacgg 780 

gcagcgtcgc cccctgcgtg ccacctctct ccacctctct cttgtccctc tctgccaccg 840 

tccacgccgc agccgcagca gccaccgcgg ctgcacacag cttccctctg tccttcgcgg 900 

gggcattccc catgcttccc ccaaacgcag cagcagcagt ggccgcggcc acagccatca 960 

gcccgccctt gtcagtatca gccacgtcca gtcctcagca gaccagcagt ggaacaaaca 1020 

ataaacctta ccgaccctgg gggacagaag ttggagcttt ttaaattttt cttgaacttc 1080 

ttgcaatagt aactgaatgt cctccatttc agagtcagct taaaacctct gcaccctgaa 1140 

ggtagccata cagatgccga cagatccaca aaggaacaat aaagctattt gagacacaaa 1200 

cctcacgagt ggaaatgtgg tattctcttt tttttctctc ccttttttgt ttggttcaag 1260 

gcagctcggt aactgacatc agcaactttt gaaaacttca cacttgttac catttagaag 1320 

tttcctggaa aatatatgga ccgtaccatc cagcagtgca tcagtatgtc tgaattgggg 1380 

aagtaaaatg ccctgactga attctcttga gactagatgg gacatacata tatagagaga 1440 

gagtgagaga gtcgtgtttc gtaagtgcct gagcttagga agttttcttc tggatatata 1500 

acattgcaca agggaagacg agtgtggagg ataggttaag aaaggaaagg gacagaagtc 1560 

ttgcaatagg ctgcagacat tttaatacca tgccagagaa gagtattctg ctgaaaccaa 1620 

caggttttac tggtcaaaat gactgctgaa aataattttc aagttgaaag atctagtttt 1680 

atcttagttt gccttctttg tacagacatg ccaagaggtg acatttagca gtgcattggt 1740 

ataagcaatt atttcatcag ttctcagatt aacaagcatt tctgctctgc ctgcaggccc 1800 

ccaggcactt ttttttttgg atggctcaaa atatggtgct gctttatata aaccttacat 1860 

ttatatagtg cacctatgag cagttgccta ccatgtgtcc accagaggct atttaattca 1920 

tgccaacttg aaaactctcc agtttgtagg agtttggttt aatttattca gtttcattag 1980 

gactattttt atatatttat cctcttcatt ttctcctaat gatgcaacat ctattcttgt 2040 

caccctttgg gagaagttac atttctggag gtgatgaagc aaggagggag cactaggaag 2100 

agaaaagcta caatttttaa agctctttgt caagttagtg attgcatttg atcccaaaac 2160 

aagatgaatg tatgcaatgg gatgtacata agttattttt gcccatgcct aaactagtgc 2220 

tatgtaatgg ggttgtggtt ttgttttttt cgatttcgtt taatgacaaa ataatctctt 2280 

aatatgctga aatcaagcac gtgagagttt ttgtttaaaa gataagagac acagcatgta 2340 

ttatgcactt catttctcta ctgtgtggag aaagcaataa acattatgag aatgttaaac 2400 

gttatgcaaa attatacttt taaatatttg ttttgaaatt actgtaccta gtcttttttg 2460 
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cattactttg taaccttttt ctatgcaaga gtctttacat accactaatt aaatgaagtc 2520 



<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Hey-2 

<400> 39 

Met Lys Arg Pro Cys Glu Glu Thr Thr Ser Glu Ser Asp Met Asp G"lu 
15 10 15 



Thr lie Asp val Gly ser Glu Asn Asn Tyr ser Gly Gin Ser Thr Ser 
20 25 30 



Ser Val He Arg Leu Asn Ser Pro Thr Thr Thr Ser Gin lie Met Ala 
35 40 45 



Arg Lys Lys Arg Arg Gly He He Glu Lys Arg Arg Arg Asp Arg lie 
50 55 60 



Asn Asn Ser Leu Ser Glu Leu Arg Arg Leu Val Pro Thr Ala Phe Glu 
65 70 75 80 



Lys Gin Gly Ser Ala Lys Leu Glu Lys Ala Glu lie Leu Gin Met Thr 
85 90 95 



Val Asp His Leu Lys Met Leu Gin Ala Thr Gly Gly Lys Gly Tyr Phe 
100 105 110 



Asp Ala His Ala Leu Ala Met Asp Phe Met Ser lie Gly Phe Arg Glu 
115 120 125 



Cys Leu Thr Glu Val Ala Arg Tyr Leu ser ser val Glu Gly Leu Asp 
130 135 140 



ser ser Asp Pro Leu Arg Val Arg Leu Val Ser His Leu Ser Thr cys 
145 150 155 160 



Ala Thr Gin Arg Glu Ala Ala Ala Met Thr Ser Ser Met Ala His His 
165 170 175 



His His Pro Leu His Pro His His Trp Ala Ala Ala Phe His His Leu 
180 185 190 



pro Ala Ala Leu Leu Gin Pro Asn Gly Leu His Ala Ser Glu ser Thr 



ctttttgact att 



2533 




195 



200 



205 
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Pro Cys Arg Leu ser Thr Thr Ser Glu Val Pro Pro Ala His Gly ser 
210 215 220 

Ala Leu Leu Thr Ala Thr Phe Ala His Ala Asp Ser Ala Leu Arg Met 
225 230 235 240 

Pro Ser Thr Gly Ser Val Ala Pro cys Val Pro Pro Leu Ser Thr ser 
245 250 255 

Leu Leu Ser Leu Ser Ala Thr Val His Ala Ala Ala Ala Ala Ala Thr 
260 265 270 

Ala Ala Ala His Ser Phe pro Leu ser Phe Ala Gly Ala Phe pro Met 
275 280 285 

Leu Pro Pro Asn Ala Ala Ala Ala Val Ala Ala Ala Thr Ala lie Ser 
290 295 300 

Pro Pro Leu Ser Val Ser Ala Thr Ser ser Pro Gin Gin Thr Ser Ser 
305 310 315 320 

Gly Thr Asn Asn Lys Pro Tyr Arg Pro Trp Gly Thr Glu Val Gly Ala 
325 330 335 



<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Hes-1 

<400> 40 

atcacacagg atccggagct ggtgctgata acagcggaat cccccgtcta cctctctcct 60 

tggtcctgga acagcgctac tgatcaccaa gtagccacaa aatataataa accctcagca 120 

cttgctcagt agttttgtga aagtctcaag taaaagagac acaaacaaaa aattcttttt 180 

cgtgaagaac tccaaaaata aaattctcta gagataaaaa aaaaaaaaaa aggaaaatgc 240 

cagctgatat aatggagaaa aattcctcgt ccccggtggc tgctacccca gccagtgtca 300 

acacgacacc ggataaacca aagacagcat ctgagcacag aaagtcatca aagcctatta 360 

tggagaaaag acgaagagca agaataaatg aaagtctgag ccagctgaaa acactgattt 420 

tggatgctct gaagaaagat agctcgcggc attccaagct ggagaaggcg gacattctgg 480 

aaatgacagt gaagcacctc cggaacctgc agcgggcgca gatgacggct gcgctgagca 540 

cagacccaag tgtgctgggg aagtaccgag ccggcttcag cgagtgcatg aacgaggtga 600 
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cccgcttcct gtccacgtgc gagggcgtta ataccgaggt gcgcactcgg ctgctcggcc 660 

acctggccaa ctgcatgacc cagatcaatg ccatgaccta ccccgggcag ccgcaccccg 720 

ccttgcaggc gccgccaccg cccccaccgg gacccggcgg cccccagcac gcgccgttcg 780 

cgccgccgcc gccactcgtg cccatccccg ggggcgcggc gccccctccc ggcggcgccc 840 

cctgcaagct gggcagccag gctggagagg cggctaaggt gtttggaggc ttccaggtgg 900 

taccggctcc cgatggccag tttgctttcc tcattcccaa cggggccttc gcgcacagcg 960 

gccctgtcat ccccgtctac accagcaaca gcggcacctc cgtgggcccc aacgcagtgt 1020 

caccttccag cggcccctcg cttacggcgg actccatgtg gaggccgtgg cggaactgag 1080 

ggggctcagg ccacccctcc tcctaaactc cccaacccac ctctcttccc tccggactct 1140 

aaacaggaac ttgaatactg ggagagaaga ggactttttt gattaagtgg ttactttgtg 1200 

tttttttaat ttctaagaag ttactttttg tagagagagc tgtattaagt gactgaccat 1260 

gcactatatt tgtatatatt ttatatgttc atattggatt gcgcctttgt attataaaag 1320 

ctcagatgac atttcgtttt ttacacgaga tttctttttt atgtgatgcc aaagatgttt 1380 

gaaaatgctc ttaaaatatc ttcctttggg gaagtttatt tgagaaaata taataaaaga 1440 

aaaaagtaaa ggcaaaaaaa aaaaaaaaaa a 1471 

<210> 41 
<211> 280 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Hes-1 



Met Pro Ala Asp lie Met Glu Lys Asn Ser Ser Ser Pro Val Ala Ala 
1 5 10 15 

Thr Pro Ala Ser Val Asn Thr Thr Pro Asp Lys Pro Lys Thr Ala ser 
20 25 30 

Glu His Arg Lys ser ser Lys Pro lie Met Glu Lys Arg Arg Arg Ala 
35 40 45 

Arg lie Asn Glu ser Leu Ser Gin Leu Lys Thr Leu lie Leu Asp Ala 
50 55 60 

Leu Lys Lys Asp Ser Ser Arg His Ser Lys Leu Glu Lys Ala Asp lie 

65 70 75 80 

Leu Glu Met Thr Val Lys His Leu Arg Asn Leu Gin Arg Ala Gin Met 
85 90 95 

Thr Ala Ala Leu ser Thr Asp pro ser val Leu Gly Lys Tyr Arg Ala 
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Gly Phe ser Glu cys Met Asn Glu val Thr Arg Phe Leu ser Thr cys 
115 120 125 

Glu Gly Val Asn Thr Glu Val Arg Thr Arg Leu Leu Gly His Leu Ala 
130 135 140 

Asn cys Met Thr Gin lie Asn Ala Met Thr Tyr pro Gly Gin Pro His 
145 150 155 160 

, Pro Ala Leu Gin Ala Pro Pro Pro Pro Pro Pro Gly Pro Gly Gly Pro 
165 170 175 

Gin His Ala Pro Phe Ala Pro Pro Pro Pro Leu val Pro lie Pro Gly 
180 185 190 

Gly Ala Ala Pro Pro Pro Gly Gly Ala Pro Cys Lys Leu Gly Ser Gin 
195 200 205 

Ala Gly Glu Ala Ala Lys Val Phe Gly Gly Phe Gin Val Val Pro Ala 
210 215 220 

pro Asp Gly Gin Phe Ala Phe Leu lie Pro Asn Gly Ala Phe Ala His 
225 230 235 240 

ser Gly pro val lie Pro val Tyr Thr ser Asn ser Gly Thr ser val 
245 250 255 

Gly Pro Asn Ala Val Ser Pro Ser ser Gly Pro ser Leu Thr Ala Asp 
260 265 270 



<210> 42 

<211> 10386 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> APC 

<220> 

<221> mi sc_f eatu re 

<222> (9521) . . (9521) 

<223> n is a, c, g, or t 

<400> 42 

attgaggact cggaaatgag gtccaagggt agccaaggat ggctgcagct tcatatgatc 60 

agttgttaaa gcaagttgag gcactgaaga tggagaactc aaatcttcga caagagctag 120 

aagataattc caatcatctt acaaaactgg aaactgaggc atctaatatg aaggaagtac 180 
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ttaaacaact 


acaaggaagt attgaagatg 


aagctatggc 


ttcttctgga 


cagattgatt 


240 


"ta.tt3.ga.gcg 


tcttaaagag cttaacttag 


atagcagtaa 


tttccctgga 


gtaaaactgc 


300 


ggtcaaaaat 




geegggaagg 


atctgtatca 


agccgttctg 


360 


gagagtgcag 


tcctgttcct atgggttcat 


ttccaagaag 


agggtttgta 


aatggaagca 


420 


gagaaagtac 


tggatattta gaagaacttg 


agaaagagag 


gtcattgett 


ettgetgate 


480 


ttgacaaaga 


agaaaaggaa aaagactggt 


attaegctea 


acttcagaat 


ctcactaaaa 


540 


gaatagatag 


tcttccttta actgaaaatt 


tttccttaca 


aacagatatg 


accagaaggc 


600 


aattggaata 


tgaagcaagg caaat cagag 


ttgcgatgga 


agaacaacta 


ggtacctgcc 


660 


aggatatgga 


aaaacgagca cagcgaagaa 


tagecagaat 


tcagcaaatc 


gaaaaggaca 


720 


tacttcgtat 


acgacagctt ttacagt ccc 


aag caacaga 


agcagagagg 


tcatctcaga 


780 


acaagcatga 


aaccggct ca catgatgctg 




tgaaggtcaa 


yy a y L yyy Q y 


840 


aaatcaacat 




agggttcaac 


tacacgaatg 


gaccatgaaa 


900 


cagccagtgt 


tttgagttct agtagcacac 


actctgcacc 


tcgaaggctg 


acaagtcatc 


960 


tgggaaccaa 


ggtiggaaatg g tg tat teat 


tgttgtcaat 


gcttggtact 


catgataagg 


1020 


atgatatgtc 


gcgaactttg ctagctatgt 


ctagctccca 


agacagctgt 


atatccatgc 


1080 


gacagtctgg 


atgtcttcct ctcctcatcc 


agcttttaca 


tggcaatgac 


aaagactctg 


1140 


tattgttggg 


aaattccegg ggcagtaaag 


aggcteggge 


cagggecagt 


gcagcactcc 


1200 


acaacatcat 


tcactcacag cctgatgaca 


agagaggcag 


gcgtgaaatc 


cgagtccttc 


1260 


atcttttgga 


acagataege gc ttac t gtg 


aaacctgttg 


aaaataacaa 


gaagctcatg 


1320 


aaccaggcat 


ggaccaggac aaaaatccaa 


tgccagctcc 


tgttgaacat 


cagatctgtc 


1380 


ctgctgtgtg 


tgttctaatg aaactttcat 


ttgatgaaga 


gcatagacat 


gcaatgaatg 


1440 


aactaggggg 


actacaggcc attgeagaat 


tattg caagt 


ggactgtgaa 


atgtacgggc 


1500 


ttactaatga 


ccactacagt attacactaa 


gaegatatge 


tggaatggct 


ttgacaaact 


1560 


tgacttttgg 


agatgtagee aacaaggcta 


cgctatgctc 


tatgaaaggc 


tgcatgagag 


1620 


cacttgtggc 


ccaactaaaa tctgaaagtg 


aagacttaca 


gcaggttatt 


gcaagtgttt 


1680 


tgaggaattt 


gtcttggcga gcagatgtaa 


atagtaaaaa 


gaegttgega 


gaagttggaa 


1740 


gtgtgaaagc 


at-i-ria-i-nna a 1-n1-nr1~1~1~;in 
a l Lyd Lyydd. Ly iyt- l l Lay 


aagttaaaaa 


ggaatcaacc 


etcaaaageg 


1800 


tattgagtgc 


cttatggaat ttgtcagcac 


attgeactga 


gaataaagct 


gatatatgtg 


1860 


ctgtagatgg 


tgeacttgea tttttggttg 


gcactcttac 


ttaceggage 


cagacaaaca 


1920 


ctttagccat 


tattgaaagt ggaggtggga 


tattaeggaa 


tgtgtccagc 


ttgatagcta 


1980 


caaatgagga 


ccacaggcaa atcctaagag 


agaacaactg 


tctacaaact 


ttattacaac 


2040 


acttaaaatc 


tcatagtttg acaatagtca 


gtaatgcatg 


tggaactttg 


tggaatctct 


2100 


cagcaagaaa 


tcctaaagac caggaagcat 


tatgggacat 


gggggcagtt 


ageatgetea 


2160 


agaacctcat 


tcattcaaag cacaaaatga 


ttgctatggg 


aagtgctgca 


gctttaagga 


2220 
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atctcatggc 


aaataggcct 


gcgaagtaca aggatgccaa "tattatgtct 


cctggctcaa 




gcttgccatc 


tcttcatgtt 


aggaaacaaa aagccctaga agcagaatta 


gatgctcagc 




acttatcaga 


aacttttgac 




c 3"t c g 1 3.9 1 3. 




agcagagaca 


caagcaaag*t 


c"tcta"tggtig a"t"ta"tg"t"t"t"t "tgacaccaat 


C Q3l C 3. "t Q 3 t Q 




ataataggtc 


agacaatttt 




"t a."t 1 1 9 3- 3. 1 3 




ctacagtgtt 


acccagctcc 


tcttcatcaa gaggaagctt agatagttct 


cgttctgaaa 


2580 


aagatiagaag 


t"t"tggagaga 




ccagcaacag 




aaaatccagg 


aacttcttca 


aagcgaggti~t "tgcagatictc caccactgca 


gcccagattg 


2700 


ccaaagtcat" 


ggaagaag~tg 




39 tit C"t999^- 


2760 


ctaccactga 


attacattgt 


gtgacagatg agagaaa~tgc acttiagaaga 




2820 


cccatacaca 


ttcaaacact 


tacaatttca ctaagtcgga aaattcaaat 


agg!«tgtt 


2880 


ctatgcctta 


tgccaaatta 


gaatacaaga gatcttcaaa tgatagttta 


aatagtgtca 


2940 


gtagtagtga 


tggttatggt 


aaaagaggtic aaatgaaacc ctcgattgaa 


tcctattctg 


3000 


aagatgatga 


aagtaagttt 


tgcagttatg gtcaataccc agccgaccta 


gcccataaaa 


3060 


~taca~tag~tgc 


aaatcatatg 


ga~tgataa~tg a~tggagaac~t aga~tacacca 


ataaattata 


3120 


gtcttaaata 


ttcagatgag 


cagttgaact ctggaaggca aagtccttca 


cagaatgaaa 


3180 


ga*tgggcaag 


acccaaacac 


ataatagaag a~tgaaa~taaa acaaagtxjag 


caaagacaat 


3240 


caaggaatca 


aagtacaact 


tatcctgttt atactgagag cactgatgat 


aaacacctca 


3300 


agttccaacc 


acattttgga 


cagcaggaat gtgtttctcc atacaggtca 


caaaaaacca 


3360 


atggttcaga 


aacaaatxga 


gtgggttcta atcatggaat "taatcaaaat 


g~taagccag~t 


3420 


ctttgtgtca 


agaagatgac 


tatgaagatg ataagcctac caattatagt 


gaacgttact 


3480 


ctgaagaaga 


acagcatgaa 


gaagaagaga gaccaacaaa ttatagcata 


aaatataatg 


3540 


aagagaaacg 


tcatgtggat 


cagcctattg attatagttt aaaatatgcc 


acagatattx 


3600 


cttxatcaca 


gaaacagtca 


ttttcattct caaagagttc atctggacaa 


agcagtaaaa 


3660 


ccgaacatat 


gtcttcaagc 


agtgagaata cgtccacacc ttcatctaat 


gccaagaggc 


3720 


agaatcagct 


ccatccaagt 


tctgcacaga gtagaagtgg tcagcctcaa 


aaggctgcca 


3780 


cttgcaaagt 


ttcttctatt 


aaccaagaaa caatacagac ttattgtgta 


gaagatactc 


3840 


caatatgttt 


ttcaagatgt 


agttcattat catctttgtc atcagctgaa 


gatgaaatag 


3900 


gatgtaatca 


gacgacacag 


gaagcagatt ctgctaatac cctgcaaata 


gcagaaataa 


3960 


aagaaaagat 






gcagtgtcac 


4020 


agcaccctag 


aaccaaatcc 


agcagactgc agggttctag tttatcttca 


gaatcagcca 


4080 


ggcacaaagc 


tgttgaattt 


tcttcaggag cgaaatctcc ctccaaaagt 


ggtgctcaga 


4140 


cacccaaaag 


tccacctgaa 


cactatgttc aggagacccc actcatgttt 


agcagatgta 


4200 


cttctgtcag 


ttcacttgat 


agttttgaga gtcgttcgat tgccagctcc 


gttcagagtg 


4260 
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a.a.cca."tgca.g 


tggaatggta 


agtggcatta taagccccag 


tgatcttcca 


gatagccctg 


4320 


9 a. c 3.3. 3. c c a.t 


gccaccaagc 


agaagtaaaa cacctccacc 


acctcctcaa 


acagctcaaa 


4380 


CC339CQ393 


agtacctaaa 


aa*taaagcac ctiactgctga 


aaagagagag 


agtggac eta 


4440 


3gC339C"t9C 


agtaaatgct 


gcagtitcaga ggg~tccagg~t 


~tc"t"tccagat 


gctgatactt 


4500 


"t 3t "t 3 C 3"t "t ~t 




agtactccag atggattttc 


ttgttcatcc 


agectgagtg 


4560 


CtC"t93QCC"t 


cgatgagcca 


tttatacaga aagatgtgga 


attaagaata 


atgcctccag 


4620 




tgacaatggg 


aatgaaacag aatcagagca 


gcctaaagaa 


tcaaatgaaa 


4680 


3CC3393933 


agaggcagaa 


aaaactattg attctgaaaa 


ggacctatta 


gatgattcag 


4740 


3tg3"t93"t93 


tattgaaata 


ctagaagaat gtattatttc 


tgccatgcca 


acaaagtcat 


4800 




aaaaaagcca 


gcccagactg cttcaaaatt 


acctccacct 


ataqcaaqaa 


4860 


33CC33gtlC3 


gctgcctgtg 


tacaaacttc taccatcaca 


aaacaggttg 


caaccccaaa 


4920 




ttttacaccg 


ggggatgata tgccacgggt 


gtattgtgtt 


gaagggacac 


4980 


Ct3"t333C"t"t 


ttccacagct 


acatctctaa gtgatctaac 


aatcgaatcc 


cctccaaatg 


5040 




tggagaagga 


gttagaggag gagcacagtc 


aggtgaattt 


gaaaaacgag 


5100 


3t3CC3ttICC 


tacagaaggc 


agaagtacag atgaggctca 


aggaggaaaa 


acctcatctg 


5160 




tgaattggat 


gacaataaag cagaggaagg 


tgatattctt 


geagaatgea 


5220 


"t 1 3 3"t "t C "t 9 C 




gggaaaagtc acaagccttt 


ccgtgtgaaa 


aagataatgg 


5280 


accaggtcca 


gcaagcatct 


gcgtcgtctt ctgcacccaa 


caaaaatcag 


ttagatggta 


5340 


agaaaaagaa 


accaac~t~tca 


ccagtaaaac ctataccaca 


aaatactgaa 


tataggacac 


5400 


gtgtaagaaa 


aaatigcagac 


"tcaaaaaata atttaaatgc 


tgagagagtt 


ttctcagaca 


5460 




aaagaaacag 


aatttgaaaa ataattccaa 


ggacttcaat 


gataagctcc 


5520 


C333"t33tg3 


agatagagtc 


agaggaagtt ttgcttttga 


ttcacctcat 


cattacacgc 


5580 


c"t3t"tg339Q 


aactccttac 


tgtttttcac gaaatgattc 


tttgagttct 


ctagattttg 


5640 


3"t 9 3 "t 9 3 1 9 


tgttgacctt 


tccagggaaa aggctgaatt 


aagaaaggca 


aaagaaaata 


5700 


3gg33"tc3.g3 


ggc*taaag*t"t 


accagccaca cagaactaac 


ctccaaccaa 


caatcagcta 


5760 


ataagacaca 


agctattgca 


aagcagccaa taaatcgagg 


tcagcctaaa 


cccatacttc 


5820 


agaaacaa c 


cacttttccc 


cagtcatcca aagacatacc 


agacagaggg 


gcagcaactg 


5880 


atgaaaagtt 


acagaatttt 


gctattgaaa atactccagt 


ttgcttttct 


cataattcct 


5940 


ctctgagttc 


tctcagtgac 


attgaccaag aaaacaacaa 


taaagaaaat 


gaacctatca 


6000 




gccccctgac 


tcacagggag aaccaagtaa 


acctcaagca 


tcaggctatg 


6060 


ctcctlaatc 


atttcatgtt 


gaaga*taccc cag~tt~tg~t"tt 


ctcaagaaac 


agttctctca 


6120 


gttctcttag 


tattgactct 


gaagatgacc tgttgcagga 


atgtataagc 


tccgcaatgc 


6180 


caaaaaagaa 


aaagccttca 


agactcaagg gtgataatga 


aaaacatagt 


cccagaaata 


6240 


tgggtggcat 


attaggtgaa 


gatctgacac ttgatttgaa 


agatatacag 


agaccagatt 


6300 
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39467A.txt 


.txt 






cagaacatgg 


tctatcccct 


gattcagaaa attttgattg 


gaaagctatt 


caggaaggtg 


6360 


caaattccat 


agtaagtagt 


ttacatcaag ctgctgctgc 


tgcatgttta 


tctagacaag 


6420 


cttcgtctga 


ttcagattcc 


atcctttccc tgaaatcagg 


aatctctctg 


ggatcaccat 


6480 


ttcatcttac 


acctgatcaa 


gaagaaaaac cctttacaag 


~taa~taaaggc 


ccacgaattc 


6540 


taaaaccagg 


ggagaaaagt 


acattggaaa ctaaaaagat 


agaatctgaa 


agtaaaggaa 


6600 


tcaaaggagg 


aaaaaaagtt 


tataaaagtt tgattactgg 


aaaagttcga 


tctaattcag 


6660 


aaatttcagg 


ccaaatgaaa 


cagccccttc aagcaaacat 


gccttcaatc 


tctcgaggca 


6720 


ggacaatgat 


tcatattcca 


ggagttcgaa atagctcctc 


aagtacaagt 


cctgtttcta 


6780 


aaaaaggccc 


accccttaag 


actccagcct ccaaaagccc 


tagtgaaggt 


caaacagcca 


6840 


ccacttctcc 


tagaggagcc 


aagccatctg tgaaatcaga 


attaagccct 


gttgccaggc 


6900 


agacatccca 


aataggtggg 


tcaagtaaag caccttctag 


atcaggatct 


agagattcga 


6960 


ccccttcaag 


acctgcccag 


caaccattaa gtagacctat 


acagtctcct 


ggccgaaact 


7020 


caatttcccc 


tggtagaaat 


ggaataagtc ctcctaacaa 


attatctcaa 


cttccaagga 


7080 


catcatcccc 


tagtactgct 


tcaactaagt cctcaggttc 


tggaaaaatg 


tcatatacat 


7140 


ctccaggtag 


acagatgagc 


caacagaacc ttaccaaaca 


aacaggttta 


tccaagaatg 


7200 


ccagtagtat 


tccaagaagt 


gagtctgcct ccaaaggact 


aaatcagatg 


aataatggta 


7260 


atggagccaa 


taaaaaggta 


gaactttcta gaatgtcttc 


aactaaatca 


agtggaagtg 


7320 


aatctgatag 


atcagaaaga 


cctgtattag tacgccagtc 


aactttcatc 


aaagaagctc 


7380 


caagcccaac 


cttaagaaga 


aaattggagg aatctgcttc 


atttgaatct 


ctttctccat 


7440 


catctagacc 


agcttctccc 


actaggtccc aggcacaaac 


tccagtttta 


agtccttccc 


7500 


ttcctgatat 


gtctctatcc 


acacattcgt ctgttcaggc 


tggtggatgg 


cgaaaactcc 


7560 


cacctaatct 


cagtcccact 


atagagtata atgatggaag 


accagcaaag 


cgccatgata 


7620 


ttgcacggtc 


tcattctgaa 


agtccttcta gacttccaat 


caataggtca 


ggaacctgga 


7680 


aacgtgagca 


cagcaaacat 


tcatcatccc ttcctcgagt 


aagcacttgg 


agaagaactg 


7740 


gaagttcatc 


ttcaattctt 


tctgcttcat cagaatccag 


tgaaaaagca 


aaaagtgagg 


7800 


atgaaaaaca 


tgtgaactct 


atttcaggaa ccaaacaaag 


taaagaaaac 


caagtatccg 


7860 


caaaaggaac 


atggagaaaa 


ataaaagaaa atgaattttc 


tcccacaaat 


agtacttctc 


7920 


agaccgtttc 


ctcaggtgct 


acaaatggtg ctgaatcaaa 


gactctaatt 


tatcaaatgg 


7980 


cacctgctgt 


ttctaaaaca 


gaggatgttt gggtgagaat 


tgaggactgt 


cccattaaca 


8040 


atcctagatc 


tggaagatct 


cccacaggta atactccccc 


ggtgattgac 


agtgtttcag 


8100 


aaaag g caaa 


tccaaacatt 


aaagattcaa aagataatca 


ggcaaaacaa 


aatgtgggta 


8160 


atggcagtgt 


tcccatgcgt 


accgtgggtt tggaaaatcg 


cctgaactcc 


tttattcagg 


8220 


tggatgcccc 


tgaccaaaaa 


ggaactgaga taaaaccagg 


acaaaataat 


cctgtccctg 


8280 


tatcagagac 


taatgaaagt 


tctatagtgg aacgtacccc 


attcagttct 


agcagctcaa 


8340 
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gcaaacacag ttcacctagt gggactgttg ctgccagagt gactcctttt aattacaacc 8400 
caagccctag gaaaagcagc gcagatagca cttcagctcg gccatctcag atcccaactc 8460 
cagtgaataa caacacaaag aagcgagatt ccaaaactga cagcacagaa tccagtggaa 8520 
cccaaagtcc taagcgccat tctgggtctt accttgtgac atctgtttaa aagagaggaa 8580 
gaatgaaact aagaaaattc tatgttaatt acaactgcta tatagacatt ttgtttcaaa 8640 
tgaaacttta aaagactgaa aaattttgta aataggtttg attcttgtta gagggttttt 8700 
gttctggaag ccatatttga tagtatactt tgtcttcact ggtcttattt tgggaggcac 8760 
tcttgatggt taggaaaaaa atagtaaagc caagtatgtt tgtacagtat gttttacatg 8820 
tatttaaagt agcatcccat cccaacttcc tttaattatt gcttgtctta aaataatgaa 8880 
cactacagat agaaaatatg atatattgct gttatcaatc atttctagat tataaactga 8940 
ctaaacttac atcagggaaa aattggtatt tatgcaaaaa aaaatgtttt tgtccttgtg 9000 
agtccatcta acatcataat taatcatgtg gctgtgaaat tcacagtaat atggttcccg 9060 
atgaacaagc tttacccagc ctgtttgctt tactgcatga atgaaactga tggttcaatt 9120 
tcagaagtaa tgattaacag ttatgtggtc acatgatgtg catagagata gctacagtgt 9180 
aataatttac actattttgt gctccaaaca aaacaaaaat ctgtgtaact gtaaaacatt 9240 
gaatgaaact attttacctg aactagattt tatctgaaag taggtagaat ttttgctatg 9300 
ctgtaatttg ttgtatattc tggtatttga ggtgagatgg ctgctctttt attaatgaga 9360 
catgaattgt gtctcaacag aaactaaatg aacatttcag aataaattat tgctgtatgt 9420 

aaactgttac tgaaattggt atttgtttga agggtcttgt ttcacatttg tattaataat 9480 

tgtttaaaat gcctctttta aaagcttata taaatttttt ncttcagctt ctatgcatta 9540 

agagtaaaat tcctcttact gtaataaaaa caattgaaga agactgttgc cacttaacca 9600 

ttccatgcgt tggcacttat ctattcctga aattctttta tgtgattagc tcatcttgat 9660 

ttttaacatt tttccactta aacttttttt tcttactcca ctggagctca gtaaaagtaa 9720 

attcatgtaa tagcaatgca agcagcctag cacagactaa gcattgagca taataggccc 9780 

acataatttc ctctttctta atattataga aattctgtac ttgaaattga ttcttagaca 9840 

ttgcagtctc ttcgaggctt tacagtgtaa actgtcttgc cccttcatct tcttgttgca 9900 

actgggtctg acatgaacac tttttatcac cctgtatgtt agggcaagat ctcagcagtg 9960 

aagtataatc agcactttgc catgctcaga aaattcaaat cacatggaac tttagaggta 10020 

gatttaatac gattaagata ttcagaagta tattttagaa tccctgcctg ttaaggaaac 10080 

tttatttgtg gtaggtacag ttctggggta catgttaagt gtccccttat acagtggagg 10140 

gaagtcttcc ttcctgaagg aaaataaact gacacttatt aactaagata atttacttaa 10200 

tatatcttcc ctgatttgtt ttaaaagatc agagggtgac tgatgataca tgcatacata 10260 

tttgttgaat aaatgaaaat ttatttttag tgataagatt catacactct gtatttgggg 10320 

agagaaaacc tttttaagca tggtggggca ctcaqataqq agtgaataca cctacctggt 10380 
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ggtcat 



10386 




<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> APC 

<400> 43 

Met Ala Ala Ala Ser Tyr Asp Gin Leu Leu Lys Gin Val Glu Ala Leu 

1 - -5- - 10 15 



Lys Met Glu Asn Ser Asn Leu Arg Gin Glu Leu Glu Asp Asn Ser Asn 
20 25 30 



His Leu Thr Lys Leu Glu Thr Glu Ala Ser Asn Met Lys Glu Val Leu 
35 40 45 



Lys Gin Leu Gin Gly Ser lie Glu Asp Glu Ala Met Ala Ser Ser Gly 
50 55 60 



Gin lie Asp Leu Leu Glu Arg Leu Lys Glu Leu Asn Leu Asp ser ser 
65 70 75 80 



Asn Phe Pro Gly Val Lys Leu Arg ser Lys Met Ser Leu Arg Ser Tyr 
85 90 95 



Gly Ser Arg Glu Gly Ser Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 110 



val Pro Met Gly ser Phe Pro Arg Arg Gly Phe val Asn Gly Ser Arg 
115 120 125 



Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 
130 135 140 



Leu Ala Asp Leu Asp Lys Glu Glu Lys Glu Lys Asp Trp Tyr Tyr Ala 
145 150 155 160 



Gin Leu Gin Asn Leu Thr Lys Arg lie Asp Ser Leu Pro Leu Thr Glu 
165 170 175 



Asn Phe Ser Leu Gin Thr Asp Met Thr Arg Arg Gin Leu Glu Tyr Glu 
180 185 190 



Ala Arg Gin lie Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cys Gin 



195 



200 
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Asp Met Glu Lys Arg Ala Gin Arg Arg lie Ala Arg lie Gin Gin lie 
210 215 220 

Glu Lys Asp lie Leu Arg lie Arg Gin Leu Leu Gin Ser Gin Ala Thr 
225 230 235 240 

Glu Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp 
245 250 255 

Ala Glu Arg Gin Asn Glu Gly Gin Gly val Gly Glu lie Asn Met Ala 
260 265 270 

Thr Ser Gly Asn Gly Gin Gly ser Thr Thr Arg Met Asp His Glu Thr 

275 - - 280 - 285 

Ala Ser val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu 
290 295 300 

Thr Ser His Leu Gly Thr Lys Val Glu Met Val Tyr Ser Leu Leu ser 
305 310 315 320 

Met Leu Gly Thr His Asp Lys Asp Asp Met Ser Arg Thr Leu Leu Ala 
325 330 335 

Met Ser Ser Ser Gin Asp Ser Cys lie Ser Met Arg Gin Ser Gly Cys 
340 345 350 

Leu Pro Leu Leu lie Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val 
355 360 365 

Leu Leu Gly Asn ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser 
370 375 380 

Ala Ala Leu His Asn lie lie His ser Gin pro Asp Asp Lys Arg Gly 
385 390 395 400 

Arg Arg Glu lie Arg Val Leu His Leu Leu Glu Gin lie Arg Ala Tyr 
405 410 415 

Cys Glu Thr Cys Trp Glu Trp Gin Glu Ala His Glu Pro Gly Met Asp 
420 425 430 

Gin Asp Lys Asn Pro Met Pro Ala Pro Val Glu His Gin lie Cys Pro 
435 440 445 

Ala Val Cys Val Leu Met Lys Leu Ser Phe Asp Glu Glu His Arg His 
450 455 460 

Ala Met Asn Glu Leu Gly Gly Leu Gin Ala lie Ala Glu Leu Leu Gin 
465 470 475 480 
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Val Asp Cys Glu Met Tyr Gly Leu Thr Asn Asp His Tyr Ser lie Thr 
485 490 495 

Leu Arg Arg Tyr Ala Gly Met Ala Leu Thr Asn Leu Thr Phe Gly Asp 
500 505 510 

Val Ala Asn Lys Ala Thr Leu cys Ser Met Lys Gly Cys Met Arg Ala 
515 520 525 

Leu val Ala Gin Leu Lys ser Glu ser Glu Asp Leu Gin Gin val lie 
530 535 540 

Ala Ser Val Leu Arg Asn Leu ser Trp Arg Ala Asp Val Asn Ser Lys 
545 550 -555 560 

Lys Thr Leu Arg Glu Val Gly Ser Val Lys Ala Leu Met Glu Cys Ala 
565 570 575 

Leu Glu Val Lys Lys Glu Ser Thr Leu Lys Ser Val Leu Ser Ala Leu 
580 585 590 

Trp Asn Leu Ser Ala His Cys Thr Glu Asn Lys Ala Asp lie Cys Ala 
595 600 605 

val Asp Gly Ala Leu Ala Phe Leu val Gly Thr Leu Thr Tyr Arg ser 
610 615 620 

Gin Thr Asn Thr Leu Ala lie lie Glu Ser Gly Gly Gly lie Leu Arg 
625 630 635 640 

Asn Val Ser Ser Leu lie Ala Thr Asn Glu Asp His Arg Gin lie Leu 
645 650 655 

Arg Glu Asn Asn cys Leu Gin Thr Leu Leu Gin His Leu Lys Ser His 
660 665 670 

Ser Leu Thr lie Val Ser Asn Ala Cys Gly Thr Leu Trp Asn Leu Ser 
675 680 685 

Ala Arg Asn Pro Lys Asp Gin Glu Ala Leu Trp Asp Met Gly Ala Val 
690 695 700 

ser Met Leu Lys Asn Leu lie His Ser Lys His Lys Met lie Ala Met 
705 710 715 720 

Gly ser Ala Ala Ala Leu Arg Asn Leu Met Ala Asn Arg Pro Ala Lys 
725 " 730 735 

Tyr Lys Asp Ala Asn lie Met Ser Pro Gly Ser Ser Leu Pro Ser Leu 
740 745 750 
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His Val Arg Lys Gin Lys Ala Leu Glu Ala Glu Leu Asp Ala Gin His 
755 760 765 

Leu ser Glu Thr Phe Asp Asn lie Asp Asn Leu Ser pro Lys Ala ser 
770 775 780 

His Arg ser Lys Gin Arg His Lys Gin Ser Leu Tyr Gly Asp Tyr Val 
785 790 795 800 

Phe Asp Thr Asn Arg His Asp Asp Asn Arg Ser Asp Asn Phe Asn Thr 
805 810 815 

Gly Asn Met Thr val Leu Ser Pro Tyr Leu Asn Thr Thr val Leu pro 

820 825 ' " 830 

Ser Ser Ser Ser Ser Arg Gly ser Leu Asp ser ser Arg ser Glu Lys 
835 840 845 

Asp Arg Ser Leu Glu Arg Glu Arg Gly lie Gly Leu Gly Asn Tyr His 
850 855 860 

Pro Ala Thr Glu Asn Pro Gly Thr Ser Ser Lys Arg Gly Leu Gin lie 
865 870 875 880 

Ser Thr Thr Ala Ala Gin lie Ala Lys val Met Glu Glu Val ser Ala 
885 890 895 

lie His Thr Ser Gin Glu Asp Arg Ser Ser Gly Ser Thr Thr Glu Leu 
900 905 910 

His Cys val Thr Asp Glu Arg Asn Ala Leu Arg Arg ser ser Ala Ala 
915 920 925 

His Thr His Ser Asn Thr Tyr Asn Phe Thr Lys Ser Glu Asn Ser Asn 
930 935 940 

Arg Thr Cys Ser Met Pro Tyr Ala Lys Leu Glu Tyr Lys Arg Ser Ser 
945 950 955 960 

Asn Asp Ser Leu Asn Ser Val Ser Ser Ser Asp Gly Tyr Gly Lys Arg 
965 970 975 

Gly Gin Met Lys Pro Ser lie Glu Ser Tyr Ser Glu Asp Asp Glu Ser 
980 985 990 

Lys phe Cys Ser Tyr Gly Gin Tyr Pro Ala Asp Leu Ala His Lys lie 
995 1000 1005 

His ser Ala Asn His Met Asp Asp Asn Asp Gly Glu Leu Asp Thr 
1010 1015 1020 
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lie Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gin Leu Asn Ser 
1025 1030 1035 



Gly Arg Gin ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lys 

1040 1045 1050 

His lie lie Glu Asp Glu lie Lys Gin Ser Glu Gin Arg Gin Ser 

1055 1060 1065 

Arg Asn Gin Ser Thr Thr Tyr Pro Val Tyr Thr Glu Ser Thr Asp 

1070 1075 1080 

Asp Lys Hi's Leu Lys Phe Gin Pro His Phe Gly Gin Gin Glu cys 

1085 1090 1095 

Val Ser Pro Tyr Arg Ser Arg Gly Ala Asn Gly Ser Glu Thr Asn 

1100 1105 1110 

Arg val Gly ser Asn His Gly lie Asn Gin Asn val Ser Gin ser 

1115 1120 1125 

Leu Cys Gin Glu Asp Asp Tyr Glu Asp Asp Lys Pro Thr Asn Tyr 

1130 1135 1140 

Ser Glu Arg Tyr Ser Glu Glu Glu Gin His Glu Glu Glu Glu Arg 

1145 1150 1155 

Pro Thr Asn Tyr ser lie Lys Tyr Asn Glu Glu Lys Arg His val 

1160 1165 1170 

Asp Gin Pro lie Asp Tyr Ser Leu Lys Tyr Ala Thr Asp lie Pro 

1175 1180 1185 

Ser Ser Gin Lys Gin Ser Phe Ser Phe Ser Lys Ser Ser Ser Gly 

1190 1195 1200 

Gin ser Ser Lys Thr Glu His Met Ser Ser Ser Ser Glu Asn Thr 

1205 1210 1215 

Ser Thr Pro Ser Ser Asn Ala Lys Arg Gin Asn Gin Leu His Pro 

1220 1225 1230 

ser ser Ala Gin Ser Arg Ser Gly Gin Pro Gin Lys Ala Ala Thr 

1235 1240 1245 

cys Lys val ser ser lie Asn Gin Glu Thr lie Gin Thr Tyr Cys 

1250 1255 1260 

Val Glu Asp Thr Pro lie Cys Phe Ser Arg Cys Ser Ser Leu ser 

1265 1270 1275 
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Ser Leu Ser Ser Ala Glu Asp Glu lie Gly Cys Asn Gin Thr Thr 
1280 1285 1290 

Gin Glu Ala Asp Ser Ala Asn Thr Leu Gin lie Ala Glu lie Lys 
1295 1300 1305 

Glu Lys lie Gly Thr Arg ser Ala Glu Asp pro Val ser Glu val 
1310 1315 1320 

Pro Ala Val Ser Gin His Pro Arg Thr Lys Ser Ser Arg Leu Gin 
1325 1330 1335 

Gly Ser ser Leu ser ser Glu Ser Ala Arg His Lys Ala Val Glu 
1340 " 1345 1350 

Phe ser Ser Gly Ala Lys Ser Pro Ser Lys Ser Gly Ala Gin Thr 
1355 1360 1365 

Pro Lys Ser Pro Pro Glu His Tyr Val Gin Glu Thr Pro Leu Met 
1370 1375 1380 

Phe ser Arg cys Thr ser val ser ser Leu Asp ser Phe Glu ser 
1385 1390 1395 

Arg Ser lie Ala Ser Ser Val Gin ser Glu Pro Cys Ser Gly Met 
1400 1405 1410 

Val Ser Gly lie lie Ser Pro Ser Asp Leu Pro Asp Ser Pro Gly 
1415 1420 1425 

Gin Thr Met Pro Pro ser Arg ser Lys Thr Pro Pro Pro pro Pro 
1430 1435 1440 

Gin Thr Ala Gin Thr Lys Arg Glu Val Pro Lys Asn Lys Ala Pro 
1445 1450 1455 

Thr Ala Glu Lys Arg Glu Ser Gly Pro Lys Gin Ala Ala val Asn 
1460 1465 1470 

Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Asp Thr Leu 
1475 1480 1485 

Leu His Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser 
1490 1495 1500 

Ser Ser Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe lie Gin Lys 
1505 1510 1515 

Asp val Glu Leu Arg lie Met Pro Pro val Gin Glu Asn Asp Asn 
1520 1525 1530 
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Gly Asn Glu Thr Glu Ser Glu Gin Pro Lys Glu ser Asn Glu Asn 
1535 1540 1545 

Gin Glu Lys Glu Ala Glu Lys Thr He Asp Ser Glu Lys Asp Leu 
1550 1555 1560 

Leu Asp Asp ser Asp Asp Asp Asp lie Glu lie Leu Glu Glu Cys 
1565 1570 1575 

lie lie ser Ala Met Pro Thr Lys ser Ser Arg Lys Ala Lys Lys 
1580 1585 1590 

Pro Ala Gin Thr Ala Ser Lys Leu Pro Pro Pro Val Ala Arg Lys 
1595 1600 1605 

Pro Ser Gin Leu Pro Val Tyr Lys Leu Leu Pro ser Gin Asn Arg 
1610 1615 1620 

Leu Gin Pro Gin Lys His val ser phe Thr Pro Gly Asp Asp Met 
1625 1630 1635 

Pro Arg Val Tyr Cys Val Glu Gly Thr Pro lie Asn Phe Ser Thr 
1640 1645 1650 

Ala Thr Ser Leu Ser Asp Leu Thr lie Glu Ser Pro Pro Asn Glu 
1655 1660 1665 

Leu Ala Ala Gly Glu Gly val Arg Gly Gly Ala Gin ser Gly Glu 
1670 1675 1680 

Phe Glu Lys Arg Asp Thr lie Pro Thr Glu Gly Arg Ser Thr Asp 
1685 1690 1695 

Glu Ala Gin Gly Gly Lys Thr ser ser val Thr lie Pro Glu Leu 
1700 1705 1710 

Asp Asp Asn Lys Ala Glu Glu Gly Asp lie Leu Ala Glu Cys lie 
1715 1720 1725 

Asn Ser Ala Met Pro Lys Gly Lys Ser His Lys Pro Phe Arg Val 
1730 1735 1740 

Lys Lys lie Met Asp Gin val Gin Gin Ala ser Ala ser ser ser 
1745 1750 1755 

Ala pro Asn Lys Asn Gin Leu Asp Gly Lys Lys Lys Lys Pro Thr 
1760 1765 1770 

Ser Pro Val Lys Pro lie Pro Gin Asn Thr Glu Tyr Arg Thr Arg 
1775 1780 1785 
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Val Arg Lys Asn Ala Asp Ser Lys Asn Asn Leu Asn Ala Glu Arg 
1790 1795 1800 

Val Phe ser Asp Asn Lys Asp Ser Lys Lys Gin Asn Leu Lys Asn 
1805 1810 1815 

Asn Ser Lys Asp Phe Asn Asp Lys Leu Pro Asn Asn Glu Asp Arg 
1820 1825 1830 

Val Arg Gly Ser Phe Ala Phe Asp ser Pro His His Tyr Thr Pro 
1835 1840 1845 

lie Glu Gly Thr Pro Tyr Cys Phe Ser Arg Asn Asp ser Leu Ser 
1850 1855 1860 

Ser Leu Asp Phe Asp Asp Asp Asp val Asp Leu ser Arg Glu Lys 
1865 1870 1875 

Ala Glu Leu Arg Lys Ala Lys Glu Asn Lys Glu Ser Glu Ala Lys 
1880 1885 1890 

val Thr ser His Thr Glu Leu Thr Ser Asn Gin Gin Ser Ala Asn 
1895 1900 1905 

Lys Thr Gin Ala lie Ala Lys Gin Pro lie Asn Arg Gly Gin Pro 
1910 1915 1920 

Lys Pro lie Leu Gin Lys Gin Ser Thr Phe Pro Gin Ser Ser Lys 
1925 1930 1935 

Asp lie pro Asp Arg Gly Ala Ala Thr Asp Glu Lys Leu Gin Asn 
1940 1945 1950 

Phe Ala lie Glu Asn Thr Pro Val Cys phe ser His Asn Ser Ser 
1955 1960 1965 

Leu ser ser Leu ser Asp He Asp Gin Glu Asn Asn Asn Lys Glu 
1970 1975 1980 

Asn Glu Pro lie Lys Glu Thr Glu Pro Pro Asp Ser Gin Gly Glu 
1985 1990 1995 

Pro Ser Lys Pro Gin Ala Ser Gly Tyr Ala Pro Lys Ser Phe His 
2000 2005 2010 

Val Glu Asp Thr Pro Val Cys Phe Ser Arg Asn Ser Ser Leu Ser 
2015 2020 2025 

ser Leu ser lie Asp ser Glu Asp Asp Leu Leu Gin Glu cys lie 
2030 2035 2040 
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Ser Ser Ala Met Pro Lys Lys Lys Lys Pro Ser Arg Leu Lys Gly 
2045 2050 2055 

Asp Asn Glu Lys His Ser Pro Arg Asn Met Gly Gly lie Leu Gly 
2060 2065 2070 

Glu Asp Leu Thr Leu Asp Leu Lys Asp lie Gin Arg Pro Asp Ser 
2075 2080 2085 

Glu Hi's Gly Leu Ser Pro Asp ser Glu Asn Phe Asp Trp Lys Ala 
2090 2095 2100 

lie Gin Glu Gly Ala Asn ser He Val Ser Ser Leu His Gin Ala 
2105 2110 2115 

Ala Ala Ala Ala Cys Leu ser Arg Gin Ala ser Ser Asp ser Asp 
2120 2125 2130 

Ser lie Leu Ser Leu Lys Ser Gly lie Ser Leu Gly Ser Pro phe 
2135 2140 2145 

His Leu Thr Pro Asp Gin Glu Glu Lys Pro Phe Thr Ser Asn Lys 
2150 2155 2160 

Gly Pro Arg lie Leu Lys Pro Gly Glu Lys Ser Thr Leu Glu Thr 
2165 2170 2175 

Lys Lys lie Glu Ser Glu Ser Lys Gly lie Lys Gly Gly Lys Lys 
2180 2185 2190 

Val Tyr Lys ser Leu lie Thr Gly Lys val Arg Ser Asn Ser Glu 
2195 2200 2205 

lie Ser Gly Gin Met Lys Gin Pro Leu Gin Ala Asn Met Pro Ser 
2210 2215 2220 

lie ser Arg Gly Arg Thr Met lie His He Pro Gly Val Arg Asn 
2225 2230 2235 

Ser Ser Ser Ser Thr ser Pro Val Ser Lys Lys Gly Pro Pro Leu 
2240 2245 2250 

Lys Thr Pro Ala ser Lys Ser Pro Ser Glu Gly Gin Thr Ala Thr 
2255 2260 2265 

Thr ser Pro Arg Gly Ala Lys Pro Ser Val Lys Ser Glu Leu Ser 
2270 2275 2280 

Pro val Ala Arg Gin Thr ser Gin lie Gly Gly ser ser Lys Ala 
2285 2290 2295 
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Pro ser Arg Ser Gly ser Arg Asp Ser Thr Pro ser Arg Pro Ala 
2300 2305 2310 

Gin Gin Pro Leu Ser Arg Pro lie Gin Ser Pro Gly Arg Asn ser 
2315 2320 2325 

lie ser pro Gly Arg Asn Gly He Ser Pro Pro Asn Lys Leu ser 
2330 2335 2340 

Gin Leu Pro Arg Thr Ser Ser pro Ser Thr Ala Ser Thr Lys Ser 
2345 2350 2355 

Ser Gly Ser Gly Lys Met Ser Tyr Thr Ser Pro Gly Arg Gin Met 
2360 2365 2370 

ser Gin Gin Asn Leu Thr Lys Gin Thr Gly Leu ser Lys Asn Ala 
2375 2380 2385 

Ser Ser He Pro Arg Ser Glu Ser Ala Ser Lys Gly Leu Asn Gin 
2390 2395 2400 

Met Asn Asn Gly Asn Gly Ala Asn Lys Lys Val Glu Leu Ser Arg 
2405 2410 2415 

Met Ser Ser Thr Lys Ser ser Gly ser Glu ser Asp Arg ser Glu 
2420 2425 2430 

Arg Pro Val Leu Val Arg Gin Ser Thr Phe lie Lys Glu Ala Pro 
2435 2440 2445 

Ser Pro Thr Leu Arg Arg Lys Leu Glu Glu Ser Ala ser Phe Glu 
2450 2455 2460 

Ser Leu Ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin 



Ala Gin Thr Pro Val Leu Ser Pro Ser Leu Pro Asp Met Ser Leu 
2480 2485 2490 

Ser Thr His Ser Ser Val Gin Ala Gly Gly Trp Arg Lys Leu Pro 
2495 2500 2505 

Pro Asn Leu Ser Pro Thr lie Glu Tyr Asn Asp Gly Arg Pro Ala 
2510 2515 2520 

Lys Arg His Asp lie Ala Arg Ser His Ser Glu Ser Pro Ser Arg 
2525 2530 2535 

Leu Pro He Asn Arg ser Gly Thr Trp Lys Arg Glu His ser Lys 
2540 2545 ' 2550 
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His ser ser ser Leu pro Arg val Ser Thr Trp Arg Arg Thr Gly 
2555 2560 2565 

Ser Ser Ser Ser lie Leu Ser Ala Ser Ser Glu Ser ser Glu Lys 
2570 2575 2580 

Ala Lys ser Glu Asp Glu Lys His Val Asn Ser He Ser Gly Thr 
2585 2590 2595 

Lys Gin Ser Lys Glu Asn Gin Val Ser Ala Lys Gly Thr Trp Arg 
2600 2605 2610 

Lys lie Lys Glu Asn Glu Phe Ser Pro Thr Asn Ser Thr Ser Gin 
2615 2620 2625 

Thr Val Ser Ser Gly Ala Thr Asn Gly Ala Glu ser Lys Thr Leu 
2630 2635 2640 

lie Tyr Gin Met Ala Pro Ala Val Ser Lys Thr Glu Asp Val Trp 
2645 2650 2655 

Val Arg lie Glu Asp Cys Pro lie Asn Asn Pro Arg Ser Gly Arg 
2660 2665 2670 

ser Pro Thr Gly Asn Thr Pro Pro val lie Asp ser val ser Glu 
2675 2680 2685 

Lys Ala Asn Pro Asn lie Lys Asp Ser Lys Asp Asn Gin Ala Lys 
2690 2695 2700 

Gin Asn val Gly Asn Gly ser val Pro Met Arg Thr Val Gly Leu 
2705 2710 2715 

Glu Asn Arg Leu Asn Ser Phe lie Gin Val Asp Ala Pro Asp Gin 
2720 2725 2730 

Lys Gly Thr Glu lie Lys Pro Gly Gin Asn Asn Pro Val Pro Val 
2735 2740 2745 

ser Glu Thr Asn Glu ser ser lie val Glu Arg Thr Pro Phe Ser 
2750 2755 2760 

Ser Ser Ser ser Ser Lys His Ser Ser Pro Ser Gly Thr Val Ala 
2765 2770 2775 

Ala Arg Val Thr Pro Phe Asn Tyr Asn Pro Ser Pro Arg Lys Ser 
2780 2785 2790 

ser Ala Asp Ser Thr Ser Ala Arg Pro Ser Gin lie Pro Thr Pro 
2795 2800 2805 
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Val A i? n „ Asn Asn Thr L V S L y s Ar 9 As P Ser Lys Thr Asp ser Thr 
2810 2815 2820 

Glu Ser ser Gly Thr Gin ser Pro Lys Arg His Ser Gly Ser Tyr 
2825 2830 2835 

Leu val Thr Ser Val 
2840 



<210> 44 

<211> 2121 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> c-myc 

<400> 44 

ctgctcgcgg ccgccaccgc 
gggcagggct tctcagaggc 
ataaaagccg gttttcgggg 
gggagcgagc gggcggccgg 
cgtcctggga agggagatcc 
cttgatcccc caggccagcg 
cagcgggcgg gcactttgca 
acgcggggag gctattctgc 
ctctgaaagg ctctccttgc 
cagcagcctc ccgcgacgat 
gactacgact cggtgcagcc 
cagcagcaga gcgagctgca 
ctgctgccca ccccgcccct 
gttgcggtca cacccttctc 
acggccgacc agctggagat 
ttcatctgcg acccggacga 
tggagcggct tctcggccgc 
gcgcgcaaag acagcggcag 
agcttgtacc tgcaggatct 
ttcccctacc ctctcaacga 
gccttctctc cgtcctcgga 
cccgagcccc tggtgctcca 
gaacaagaag atgaggaaga 



cgggccccgg 
ttggcgggaa 
ctttatctaa 
ctagggtgga 
ggagcgaata 
gtccgcaacc 
ctggaactta 
ccatttgggg 
agctgcttag 
gcccctcaac 
gtatttctac 
gcccccggcg 
gtcccctagc 
ccttcgggga 
ggtgaccgag 
cgagaccttc 
cgccaagctc 
cccgaacccc 
gagcgccgcc 
cagcagctcg 
ttctctgctc 
tgaggagaca 
aatcgatgtt 



ccgtccctgg 
aaaagaacgg 
ctcgctgtag 
agagccgggc 
gggggcttcg 
cttgccgcat 
caacacccga 
acacttcccc 
acgctggatt 
gttagcttca 
tgcgacgagg 
cccagcgagg 
cgccgctccg 
gacaacgacg 
ctgctgggag 
atcaaaaaca 
gtctcagaga 
gcccgcggcc 
gcctcagagt 
cccaagtcct 
tcctcgacgg 
ccgcccacca 
gtttctqtqq 



ctcccctcct 
agggagggat 
taattccagc 
gagcagagct 
cctctggccc 
ccacgaaact 
gcaaggacgc 
gccgctgcca 
tttttcgggt 
ccaacaggaa 
aggagaactt 
atatctggaa 
ggctctgctc 
gcggtggcgg 
gagacatggt 
tcatcatcca 
agctggcctc 
acagcgtctg 
gcatcgaccc 
gcgcctcgca 
agtcctcccc 
ccagcagcga 
aaaagaggca 



gcctcgagaa 
cgcgctgagt 
gagaggcaga 
gcgctgcggg 
agccctcccg 
ttgcccatag 
gactctcccg 
ggacccgctt 
agtggaaaac 
ctatgacctc 
ctaccagcag 
gaaattcgag 
gccctcctac 
gagcttctcc 
gaaccagagt 
ggactgtatg 
ctaccaggct 
ctccacctcc 
ctcggtggtc 
agactccagc 
gcagggcagc 
ctctgaggag 
ggctcctggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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aaaaggtcag agtctggatc accttctgct ggaggccaca gcaaacctcc tcacagccca 1440 

ctggtcctca agaggtgcca cgtctccaca catcagcaca actacgcagc gcctccctcc 1500 

actcggaagg actatcctgc tgccaagagg gtcaagttgg acagtgtcag agtcctgaga 1560 

cagatcagca acaaccgaaa atgcaccagc cccaggtcct cggacaccga ggagaatgtc 1620 

aagaggcgaa cacacaacgt cttggagcgc cagaggagga acgagctaaa acggagcttt 1680 

tttgccctgc gtgaccagat cccggagttg gaaaacaatg aaaaggcccc caaggtagtt 1740 

atccttaaaa aagccacagc atacatcctg tccgtccaag cagaggagca aaagctcatt 1800 

tctgaagagg acttgttgcg gaaacgacga gaacagttga aacacaaact tgaacagcta 1860 

cggaactctt gtgcgtaagg aaaagtaagg aaaacgattc cttctaacag aaatgtcctg 1920 

agcaatcacc tatgaacttg tttcaaatgc atgatcaaat gcaacctcac aaccttggct 1980 

gagtcttgag actgaaagat ttagccataa tgtaaactgc ctcaaattgg actttgggca 2040 

taaaagaact tttttatgct taccatcttt tttttttctt taacagattt gtatttaaga 2100 

attgttttta aaaaatttta a 2121 

<210> 45 

<211> 439 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> C-myc 



Met Pro Leu Asn Val Ser Phe Thr Asn Arg Asn Tyr Asp Leu Asp Tyr 
1 5 10 15 

Asp Ser val Gin Pro Tyr phe Tyr cys Asp Glu Glu Glu Asn Phe Tyr 
20 25 30 

Gin Gin Gin Gin Gin Ser Glu Leu Gin Pro Pro Ala Pro Ser Glu Asp 
35 40 45 

lie Trp Lys Lys Phe Glu Leu Leu Pro Thr Pro Pro Leu Ser Pro Ser 
50 55 60 

Arg Arg Ser Gly Leu cys Ser Pro Ser Tyr val Ala Val Thr Pro Phe 
65 70 75 80 

Ser Leu Arg Gly Asp Asn Asp Gly Gly Gly Gly Ser Phe Ser Thr Ala 
85 90 95 

Asp Gin Leu Glu Met Val Thr Glu Leu Leu Gly Gly Asp Met Val Asn 
100 105 110 
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Gin Ser Phe He Cys Asp Pro Asp Asp Glu Thr Phe lie Lys Asn lie 
115 120 125 

He lie Gin Asp Cys Met Trp Ser Gly phe ser Ala Ala Ala Lys Leu 
130 135 140 

Val Ser Glu Lys Leu Ala Ser Tyr Gin Ala Ala Arg Lys Asp Ser Gly 
145 150 155 160 

Ser Pro Asn Pro Ala Arg Gly His Ser val Cys Ser Thr Ser Ser Leu 
165 170 175 

Tyr Leu Gin Asp Leu Ser Ala Ala Ala Ser Glu Cys lie Asp Pro Ser 
180 185 190 

Val val Phe Pro Tyr Pro Leu Asn Asp Ser Ser Ser Pro Lys Ser Cys 
195 200 205 

Ala Ser Gin Asp ser Ser Ala Phe ser Pro Ser ser Asp Ser Leu Leu 
210 215 220 

Ser Ser Thr Glu Ser Ser Pro Gin Gly Ser Pro Glu Pro Leu Val Leu 
225 230 235 240 

His Glu Glu Thr Pro Pro Thr Thr Ser ser Asp Ser Glu Glu Glu Gin 
245 250 255 

Glu Asp Glu Glu Glu lie Asp val Val ser val Glu Lys Arg Gin Ala 
260 265 270 

Pro Gly Lys Arg Ser Glu Ser Gly ser Pro Ser Ala Gly Gly His Ser 
275 280 285 

Lys Pro Pro His Ser Pro Leu Val Leu Lys Arg cys His val ser Thr 
290 295 300 

His Gin His Asn Tyr Ala Ala Pro Pro Ser Thr Arg Lys Asp Tyr Pro 
305 310 315 320 

Ala Ala Lys Arg val Lys Leu Asp ser val Arg Val Leu Arg Gin lie 
325 330 335 

ser Asn Asn Arg Lys cys Thr ser Pro Arg ser ser Asp Thr Glu Glu 
340 345 350 

Asn Val Lys Arg Arg Thr His Asn Val Leu Glu Arg Gin Arg Arg Asn 
355 360 365 

Glu Leu Lys Arg Ser Phe Phe Ala Leu Arg Asp Gin lie Pro Glu Leu 
370 375 380 
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Glu Asn Asn Glu Lys Ala Pro Lys Val Val lie Leu Lys Lys Ala Thr 
385 390 395 400 

Ala Tyr lie Leu Ser Val Gin Ala Glu Glu Gin Lys Leu lie Ser Glu 
405 410 415 

Glu Asp Leu Leu Arg Lys Arg Arg Glu Gin Leu Lys His Lys Leu Glu 
420 425 430 

Gin Leu Arg Asn Ser Cys Ala 
435 

<210> 46 

<211> 11 

<212> PRT 

<213> HIV 



<220> 

<221> misc_feature 
<223> TAT protein 



Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 

<210> 47 

<211> 54 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic primer 
<220> 

<221> misc_feature 

<223> prox-1 sense 

<400> 47 

tggtcatctg caagctggat ttcaagagaa tccagcttgc agatgacctt tttc 

<210> 48 

<211> 58 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic primer 
<220> 

<221> misc_feature 

<223> Prox-1 anti -sense 

<400> 48 

tcgagaaaaa aggtcatctg caagctggat tctcttgaaa tccagcttgc agtgacca 

<210> 49 

<211> 55 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Synthetic primer 
<220> 

<221> misc_feature 
<223> Prox-2 sense 

<400> 49 

tgagccagtt tgatatggat ttcaagagaa tccatatcaa actggctctt ttttc 55 



<210> 50 

<211> 58 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic primer 
<220> 

<221> misc_feature 

<223> Prox-2 anti -sense 

<400> 50 

tcgagaaaaa agagccagtt tgatatggat tctcttgaaa tccatatcaa actgctca 58 
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